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HLA-A2.1 BINDING PEPTIDES AND THEIR USES 



Cross-Reference tn Related Ap plirahW 

[0001] The present application is a continuation in part of USSN 08/159,184, filed 29 
November 1993, Gray, et al. (abandoned), which is a continuation in part of USSN 
08/073,205, filed 4 June 1993, Gray et al. (abandoned), which is a continuation in part of 
08/027,146, filed 5 March 1993, Gray, et al. (abandoned). All of which are incorporated 

herein by reference. 

Technical Field 

[0002] The present invention relates to compositions and methods for preventing, treating 
or diagnosing a number of pathological states such as viral diseases and cancers. In 
particular, it provides novel peptides capable of binding selected major histocompatibility 
complex (MHC) molecules and inducing an immune response. 

Background Art 

[0003] MHC molecules are classified as either Class I or Class H molecules. Class II 
MHC molecules are expressed primarily on cells involved in initiating and sustaining 
immune responses, such as T lymphocytes, B lymphocytes, macrophages, etc. Class n MHC 
molecules are recognized by helper T lymphocytes and induce proliferation of helper T. 
lymphocytes and amplification of the immune response to the particular immunogenic 
peptide that is displayed. Class I MHC molecules are expressed on almost all nucleated cells 
and are recognized by cytotoxic T lymphocytes (CTLs), which then destroy the antigen- 
bearing cells. CTLs are particularly important in tumor rejection and in fighting viral 
infections. The CTL recognizes the antigen in the form of a peptide fragment bound to the 
MHC class I molecules rather than the intact foreign antigen itself. The antigen must 
normally be endogenously synthesized by the cell, and a portion of the protein antigen is 
degraded into small peptide fragments in the cytoplasm. Some of these small peptides 
translocate into a pre-Golgi compartment and interact with class I heavy chains to facilitate 
proper folding and association with the subunit oQ microglobulin. The peptide-MHC class I 
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complex is then routed to the cell surface for expression and potential recognition by specific 
CTLs. 

[0004] Investigations of the crystal structure of the human MHC class I molecule, 
HLA-A2.1, indicate that a peptide binding groove is created by the folding of the al and-o2 
domains of the class I heavy chain (Bjorkman et al., Nature 329:506 ( 1987). In these 
investigations, however, the identity of peptides bound to the groove was not determined: 

[0005] Buus et al., Science 242:1065 (1988) first described a method for acid elution of 
bound peptides from MHC. Subsequently, Rammensee and his coworkers (Falk et al., 
Nature 351:290 (1991) have developed an approach to characterize naturally processed 
peptides bound to class I molecules. Other investigators have successfully achieved direct 
amino acid sequencing of the more abundant peptides in various HPLC fractions by 
conventional automated sequencing of peptides eluted from class I molecules of the B type 
(Jardetzky, et al., Nature 353:326 (1991) and of the A2.1 type by mass spectrometry (Hunt, et 
al., Science 225:1261 (1992). A review of the characterization of naturally processed 
peptides in MHC Class I has been presented by Rotzschke and Falk (Rotzschke and Falk, 
Immunol. Today 12:447 (1QQ1) 

[0006] Sette et al., Proc. Natl. Acad Sri ttqa 86:3296 (1989) showed that MHC allele 
specific motifs could be used to predict MHC binding capacity. Schaeffer et al., Proc. Natl. 
Acad. Sci. USA 86:4649 (1989) showed that MHC binding was related to immunogenicity. 
Several authors (De Bruijn et al., Eur. J. Immunol 21 :2963-2970 (1991); Pamer et al., 991 
Nature 353:852-955 (1991)) have provided preliminary evidence that class I binding motifs 
can be applied to the identification of potential immunogenic peptides in animal models. 
Class I motifs specific for a number of human alleles of a given class I isotype have yet to be 
described. It is desirable that the combined frequencies of these different alleles should be 
high enough to cover a large fraction or perhaps the majority of the human outbred 
population. 

[0007] Despite the developments in the art, the prior art has yet to provide a useful human 
peptide-based vaccine or therapeutic agent based on this work. The present invention 
provides these and other advantages. 



Disclosure of the Invention 

[0008] The present invention provides compositions comprising immunogenic peptides 
having binding motifs for HLA-A2.1 molecules. The immunogenic peptides, which bind to 
the appropriate MHC allele, are preferably 9 to 10 residues in length and comprise conserved 
residues at certain positions such as positions 2 and 9. Moreover, the peptides do not 
comprise negative binding residues as defined herein at other positions such as positions 1, 3, 
6 and/or 7 in the case of peptides 9 amino acids in length and positions 1, 3, 4, 5, 7, 8 and/or 
9 in the case of peptides 10 amino acids in length. The present invention defines positions 
within a motif enabling the selection of peptides which will bind efficiently to HLA A2.1. 

[0009] Epitopes on a number of immunogenic target proteins can be identified using the 
peptides of the invention. Examples of suitable antigens include prostate cancer specific 
antigen (PSA), hepatitis B core and surface antigens (HBVc, HBVs) hepatitis C antigens, 
Epstein-Barr virus antigens, human immunodeficiency type- 1 virus (HTV 1 ) and papilloma 
virus antigens. The peptides are thus useful in pharmaceutical compositions for both in vivo 
and ex vivo therapeutic and diagnostic applications. 

Definitions 

[0010] The term "peptide" is used interchangeably with "oligopeptide" in the present 
specification to designate a series of residues, typically L-amino acids, connected one to the 
other typically by peptide bonds between the alpha-amino and carbonyl groups of adjacent 
amino acids. The oligopeptides of the invention are less than about 15 residues in length and 
usually consist of between about 8 and about 1 1 residues, preferably 9 or 10 residues. 

[0011] An "immunogenic peptide" is a peptide which comprises an allele-specific motif 
such that the peptide will bind an MHC molecule and induce a CTL response. Immunogenic 
peptides of the invention are capable of binding to an appropriate HLA-A2.1 molecule and 
inducing a cytotoxic T cell response against the antigen from which the immunogenic 
peptide is derived. 

[0012] Immunogenic peptides are conveniently identified using the algorithms of the 
invention. The algorithms are mathematical procedures that produce a score which enables 
the selection of immunogenic peptides. Typically one uses the algorithmic score with a 
"binding threshold" to enable selection of peptides that have a high probability of binding at 
a certain affinity and will in turn be immunogenic. The algorithm is based upon either the . 

- 3 - 



effects on MHC binding of a particular amino acid at a particular position of a peptide or the 
effects on binding of a particular substitution in a motif containing peptide. 

(0013] A "conserved residue" is an amino acid which occurs in a significantly higher 
frequency than would be expected by random distribution at a particular position in a 
peptide. Typically a conserved residue is one where the MHC structure may provide a 
contact point with the immunogenic peptide. One to three, preferably two, conserved 
residues within a peptide of defined length defines a motif for an immunogenic peptide. 
These residues are typically in close contact with the peptide binding groove, with their side 
chains buried in specific pockets of the groove itself. Typically, an immunogenic peptide 
will comprise up to three conserved residues, more usually two conserved residues. 

[0014] As used herein, "negative binding residues" are amino acids which if present at 
certain positions (for example, positions 1, 3 and/or 7 of a 9-mer) will result in a peptide 
being a nonbinder or poor binder and in turn fail to be immunogenic i.e. induce a CTL 
response. 

[0015] The term "motif refers to the pattern of residues in a peptide of defined length, 
usually about 8 to about 1 1 amino acids, which is recognized by a particular MHC allele. 
The peptide motifs are typically different for each human MHC allele and differ in the 
pattern of the highly conserved residues and negative residues. 

[0016] The binding motif for an allele can be defined with increasing degrees of precision. 
In one case, all of the conserved residues are present in the correct positions in a peptide and 
there are no negative residues in positions 1, 3 and/or 7. 

[0017] The phrases "isolated" or "biologically pure" refer to material which is substantially 
or essentially free from components whi<;h normally accompany it as found in its native state. 
Thus, the peptides of this invention do not contain materials normally associated with their 
in situ environment, e.g., MHC I molecules on antigen presenting cells. Even where a 
protein has been isolated to a homogenous or dominant band, there are trace contaminants in 
the range of 5-10% of native protein which co-purify with the desired protein. Isolated 
peptides of this invention do not contain such endogenous co-purified protein. 

[0018] The term "residue" refers to an amino acid or amino acid mimetic incorporated in 
an oligopeptide by an amide bond or amide bond mimetic. 



Description of t he Preferred E mbodiments 

[0019] The present invention relates to the determination of allele-specific peptide motifs 
for human Class I MHC (sometimes referred to as HLA) allele subtypes, in particular, 
peptide motifs recognized by HLA-A2. 1 alleles. These motifs are then used to define T cell 
epitopes from any desired antigen, particularly those, associated with human viral diseases, 
cancers or autoiummune diseases, for which the amino acid sequence of the potential antigen 
or autoantigen targets is known. 

[0020] Epitopes on a number of potential target proteins can be identified in this manner. 
Examples of suitable antigens include prostate specific antigen (PSA), hepatitis B core and 
surface antigens (HBVc, HBVs) hepatitis C antigens, Epstein-Barr virus antigens, melanoma 
antigens (e.g., MAGE-1), human immunodeficiency virus (HIV) antigens and human 
papilloma virus (HPV) antigens. 

[0021] Autoimmune associated disorders for which the peptides of the invention may be 
employed to relieve the symptoms of, treat or prevent the occurrence or reoccurrence of 
include, for example, multiple sclerosis (MS), rheumatoid arthritis (RA), Sjogren syndrome, 
scleroderma, polymyositis, dermatomyositis, systemic lupus erythematosus, juvenile 
rheumatoid arthritis, ankylosing spondylitis, myasthenia gravis (MG), bullous pemphigoid 
(antibodies to basement membrane at dermal-epidermal junction), pemphigus (antibodies to 
mucopolysaccharide protein complex or intracellular cement substance), glomerulonephritis 
(antibodies to glomerular basement membrane), Goodpasture's syndrome, autoimmune 
hemolytic anemia (antibodies to erythrocytes), Hashimoto's disease (antibodies to thyroid), 
pernicious anemia (antibodies to intrinsic factor), idiopathic thrombocytopenic purpura 
(antibodies to platelets), Grave's disease, and Addison's disease (antibodies to thyroglobulin), 
and the like. 

[0022] The autoantigens associated with a number of these diseases have been identified. 
For example, in experimentally induced autoimmune diseases, antigens involved in 
pathogenesis have been characterized: in arthritis in rat and mouse, native type-H collagen is 
identified in collagen-induced arthritis, and mycobacterial heat shock protein in adjuvant 
arthritis; thyroglobulin has been identified in experimental allergic thyroiditis (EAT) in 
mouse; acetyl choline receptor (AChR) in experimental allergic myasthenia gravis (EAMG); 
and myelin basic protein (MB?) and proteolipid protein (PLP) in experimental allergic 
encephalomyelitis (EAE) in mouse and rat. In addition, target antigens have been identified 



in humans: type-H collagen in human rheumatoid arthritis; and acetyl choline receptor in 

myasthenia gravis. 

10023] Peptides comprising the epitopes from these antigens are synthesized and then 
tested for their ability to bind to the appropriate MHC molecules in assays using, for 
example, purified class I molecules and radioiodonated peptides and/or cells expressing 
empty class I molecules by, for instance, immunofluorescent staining and flow 
microfluorometry, peptide-dependent class I assembly assays, and inhibition of CTL 
recognition by peptide competition. Those peptides that bind to the class I molecule are 
further evaluated for their ability to serve as targets for CTLs derived from infected or 
immunized individuals, as well as for their capacity to induce primary in vitro or in vivo CTL 
responses that can give rise to CTL populations capable of reacting with virally infected 
target cells or tumor cells as potential therapeutic agents. 

[0024] The MHC class I antigens are encoded by the HLA-A, B, and C loci. HLA-A and 
B antigens are expressed at the cell surface at approximately equal densities, whereas the 
expression of HLA-C is significantly lower (perhaps as much as 10-fold lower). Each of 
these loci have a number of alleles. The peptide binding motifs of the invention are 
relatively specific for each allelic subtype. 

[0025] For peptide-based vaccines, the peptides of the present invention preferably 
comprise a motif recognized by an MHC I molecule having a wide distribution in the human 
population. Since the MHC alleles occur at different frequencies within different ethnic 
groups and races, the choice of target MHC allele may depend upon the target population. 
Table 1 shows the frequency of various alleles at the HLA-A locus products among different 
races. For instance, the majority of the Caucasoid population can be covered by peptides 
which bind to four HLA-A allele subtypes, specifically HLA-A2.1, Al, A3.2, and A24.1. 
Similarly, the majority of the Asian population is encompassed with the addition of peptides 
binding to a fifth allele HLA-A 1 1 .2. 



TABLE 1 



A Allele/Subtvpe 

Al 

A2.1 

A2.2 

A2.3 

A2.4 

A2.5 

A3.1 

A3 .2 
AIM 
A11.2 
A11.3 
A23 
A24 
A24.2 
A24.3 
A25 
A26.1 
A26.2 
A26V 
A28.1 
A28.2 
A29.1 
A29.2 
A30.1 
A30.2 
A30.3 
A31 
A32 
Aw33.1 
Aw33.2 
Aw34.1 
Aw34.2 
Aw36 



N(69)' 

10.1(7) 
11.5(8) 
10.1(7) 
1.4(1) 



1.4(1) 
5.7(4) 
0 

5.7(4) 
0 

4.3(3) 
2.9(2) 



1.4(1) 
4.3(3) 
7.2(5) 

10.1(7) 
1.4(1) 
1.4(1) 
10.1(7) 
8.6(6) 
1.4(1) 
7.2(5) 
4.3(3) 
2.8(2) 
8.6(6) 
2.8(2) 
1.4(1) 
14.5(10) 
5.9(4) 



Mm 

1.8(1) 
37.0(20) 
0 

5.5(3) 



0 

5.5(3) 
5.5(3) 
31.4(17) 
3.7(2) 

27.7(15) 



9.2(5) 
3.7(2) 



1.8(1) 



7.4(4) 



16.6(9) 



C(502) 

27.4(138) 
39.8(199) 
3.3(17) 
0.8(4) 



0.2(0) 
21.5(108) 
0 

8.7(44) 
0 

3.9(20) 
15.3(77) 



6.9(35) 
5.9(30) 
1.0(5) 

1.6(8) 

7.5(38) 

1.4(7) 

5.3(27) 

4.9(25) 

0.2(1) 

3.9(20) 
.6.9(35) 
7.1(36) 
2-5(13) 
1.2(6) 

0.8(4) 



Table compiled from B. DuPont, Immunobiologv of m A Vol. I, Histocompatibility Testing 1987 
Spnnger-Verlag, New York 1989. • 

* 

N - negroid; A = Asian; C = caucasoid. Numbers in 

parenthesis represent the number of individuals included in the analysis. 

[0026] The nomenclature used to describe peptide compounds follows the conventional 
practice wherein the amino group is presented to the left (the N-terminus) and the carboxyl 
group to the right (the C-terminus) of each amino acid residue. In the formulae representing 
selected specific embodiments of the present invention, the amino- and carboxyl-terminai 



groups, although not specifically shown, are in the form they would assume at physiologic 
pH values, unless otherwise specified. In the amino acid structure formulae, each residue is 
generally represented by standard three letter or single letter designations. The L-form of an 
amino acid residue is represented by a capital single letter or a capital first letter of a three- 
letter symbol, and the D-form for those amino acids having D-forms is represented by a 
lower case single letter or a lower case three letter symbol. Glycine has no asymmetric 
carbon atom and is simply referred to as "Gly" or G. 

Modes of Carrying Out the Invention 

[0027] The procedures used to identify peptides of the present invention generally follow 
the methods disclosed in Falk et al, Nature 351:290 (1991), which is incorporated herein by 
reference. Briefly, the methods involve large-scale isolation of MHC class I molecules, 
typically by immunoprecipitation or affinity chromatography, from the appropriate cell or 
cell line. Examples of other methods for isolation of the desired MHC molecule equally well 
known to the artisan include ion exchange chromatography, lectin chromatography, size 
exclusion, high performance ligand chromatography, and a combination of all of the above 
techniques. 

[0028] In the typical case, immunoprecipitation is used to isolate the desired allele. A 
number of protocols can be used, depending upon the specificity of the antibodies used. For 
example, allele-specific mAb reagents can be used for the affinity purification of the HLA-A, 
HLA-Bi, and HLA-C molecules. Several mAb reagents for the isolation of HLA-A 
molecules are available. The monoclonal BB7.2 is suitable for isolating HLA-A2 molecules. 
Affinity columns prepared with these mAbs using standard techniques are successfully used 
to purify the respective HLA-A allele products. 

[0029J In addition to allele-specific mAbs, broadly reactive anti-HLA-A, B, C mAbs, such 
as W6/32 and B9.12.1, and one anti-HLA-B, C mAb, Bl.23.2, could be used in alternative 
affinity purification protocols as described in the example section below. The peptides 
bound to the peptide binding groove of the isolated MHC molecules are eluted typically 
using acid treatment. Peptides can also be dissociated from class I molecules by a variety of 
standard denaturing means, such as heat, pH, detergents, salts, chaotropic agents, or a 
combination thereof Peptide fractions are further separated from the MHC molecules by 
reversed-phase high performance liquid chromatography (HPLC) and sequenced. Peptides 



can be separated by a variety of other standard means well known to the artisan, including 
filtration, ultrafiltration, electrophoresis, size chromatography, precipitation with specific 
antibodies, ion exchange chromatography, isoelectrofocusing, and the like. 

[0030] Sequencing of the isolated peptides can be performed according to standard 
techniques such as Edman degradation (Hunkapiller, M.W., et aL Methods Enzvmol. 91, 
399 [1983]). Other methods suitable for sequencing include mass spectrometry sequencing 
of individual peptides as previously described (Hunt, et al., Science 225:1261 (1992), which 
is incorporated herein by reference). Amino acid sequencing of bulk heterogenous peptides 
(e.g. , pooled HPLC fractions) from different class I molecules typically reveals a 
characteristic sequence motif for each class I allele. 

[0031] Definition of motifs specific for different class I alleles allows the identification of 
. potential peptide epitopes from an antigenic protein whose amino acid sequence is known. 
Typically, identification of potential peptide epitopes is initially carried out using a computer 
to scan the amino acid sequence of a desired antigen for the presence of motifs. The epitopic 
sequences are then synthesized. The capacity to bind MHC Class molecules is measured in a 
variety of different ways. One means is a Class I molecule binding assay as described in 
Example 8, below. Other alternatives described in the literature include inhibition of antigen 
presentation (Sette, et al., J. Immunol. 141:3893 (1991), in vitro assembly assays (Townsend, 
et al., CeU 62:285 (1990), and FACS based assays using mutated ells, such as RMA.S 
(Melief, et al., Eur. J. Immunol . 21:2963 (1991)). 

[0032] Next, peptides that test positive in the MHC class I binding assay are assayed for 
the ability of the peptides to induce specific CTL responses in vitro . For instance, antigen- 
presenting cells that have been incubated with a peptide can be assayed for the ability to 
induce CTL responses in responder cell populations. Antigen-presenting cells can be normal 
cells such as peripheral blood mononuclear cells or dendritic cells (Inaba, et al, J. Exp. Med. 
166:182 (1987); Boog, Eur. J. Immunol . 18:219 [1988]). 

[0033] Alternatively, mutant mammalian cell lines that are deficient in their ability to load 
class I molecules with internally processed peptides, such as the mouse cell lines RMA-S 
(Karre, et aL. Nature, 319:675 (1986); Ljunggren, et al., Eur. J. Immunol . 21:2963-2970 
(1991)), and the human somatic T cell hybrid, T-2 (Cerundolo, et al, Nature 345:449-452 
(1990)) and which have been transfected with the appropriate human class I genes are. 
conveniently used, when peptide is added to them, to test for the capacity of the peptide to 



induce in vitro primary CTL responses. Other eukaryotic cell lines which could be used 
include various insect cell lines such as mosquito larvae (ATCC cell lines CCL 125, 126, 
1660, 1591, 6585, 6586), silkworm (ATTC CRL 8851), armyworm (ATCC CRL 1711), 
moth (ATCC CCL 80) and Drosophila cell lines such as a Schneider cell line (see Schneider 
J. Embrvol. Exp. Morphol. 27:351-^ [1Q77]) 

[0034] Peripheral blood lymphocytes are conveniently isolated following simple 
venipuncture or leukapheresis of normal donors or patients and used as the responder cell 
sources of CTL precursors. In one embodiment, the appropriate antigen-presenting cells are 
incubated with 10-100 fiM of peptide in serum-free media for 4 hours under appropriate 
culture conditions. The peptide-loaded antigen-presenting cells are then incubated with their 
responder cell populations in vitro for 7 to 1 0 days under optimized culture conditions. 
Positive CTL activation can be determined by assaying the cultures for the presence of CTLs 
that kill radiolabeled target cells, both specific peptide-pulsed targets as well as target cells 
expressing endogenously processed form of the relevant virus or tumor antigen from which 
the peptide sequence was derived. 

[0035] Specificity and MHC restriction of the CTL is determined by testing against 
different peptide target cells expressing appropriate or inappropriate human MHC class I. 
The peptides that test positive in the MHC binding assays and give rise to specific CTL 
responses are referred to herein as immunogenic peptides. 

[0036] The immunogenic peptides can be prepared synthetically, or by recombinant DNA 
technology or from natural sources such as whole viruses or tumors. Although the peptide 
will preferably be substantially free of other naturally occurring host cell proteins and 
fragments thereof, in some embodiments the peptides can be synthetically conjugated to 
native fragments or particles. 

[0037] The polypeptides or peptides can be a variety of lengths, either in their neutral 
(uncharged) forms or in forms which are salts, and either free of modifications such as 
glycosylation, side chain oxidation, or phosphorylation or containing these modifications, 
subject to the condition that the modification not destroy the biological activity of the 
polypeptides as herein described. 

[0038] Desirably, the peptide will be as small as possible while still maintaining 
substantially all of the biological activity of the large peptide. When possible, it may be 
desirable to optimize peptides of the invention to a length of 9 or 10 amino acid residues, 



commensurate in size with endogenously processed viral peptides or tumor cell peptides that 
are bound to MHC class I molecules on the cell surface. 

[0039] Peptides having the desired activity may be modified as necessary to provide 
certain desired attributes, e.g., improved pharmacological characteristics, while increasing or 
at least retaining substantially all of the biological activity of the unmodified peptide to bind 
the desired MHC molecule and activate the appropriate T cell. For instance, the peptides 
may be subject to various changes, such as substitutions, either conservative or non- 
conservative, where such changes might provide for certain advantages in their use, such as 
improved MHC binding. By conservative substitutions is meant replacing an amino acid 
residue with another which is biologically and/or chemically similar, e.g., one hydrophobic 
residue for another, or one polar residue for another. The substitutions include combinations 
such as Gly, Ala; Val, He, Leu, Met; Asp, Glu; Asn, Gin; Ser, Thr; Lys, Arg; and Phe, Tyr. 
The effect of single amino acid substitutions may also be probed using D-amino acids. Such 
modifications may be made using well known peptide synthesis procedures, as described in 

* 

e.g., Merrifield, Science 232:341-347 (1986), Barany and Merrifield, The Peptides . Gross 
and Meienhofer, eds. (N.Y., Academic Press), pp. 1-284 (1979); and Stewart and Young, 
Solid Phase Peptide Synthesis . (Rockford, 111., Pierce), 2d Ed. (1984), incorporated by 
reference herein. 

[0040] The peptides can also be modified by extending or decreasing the compound's 
amino acid sequence, e.g., by the addition or deletion of amino acids. The peptides or 
analogs of the invention can also be modified by altering the order or composition of certain 
residues, it being readily appreciated that certain amino acid residues essential for biological 
activity, e.g., those at critical contact sites or conserved residues, may generally not be altered 
without an adverse effect on biological activity. The non-critical amino acids need not be 
limited to those naturally occurring in proteins, such as L-oc-amino acids, or their D-isomers, 

but may include non-natural amino acids as well, such as B-5-8-amino acids, as well as many 
derivatives of L-a-amino acids. 

[0041] Typically, a series of peptides with single amino acid substitutions are employed to 
determine the effect of electrostatic charge, hydrophobicity, etc. on binding. For instance, a 
series of positively charged (e.g., Lys or Arg) or negatively charged (e.g., Glu) amino acid 
substitutions are made along the length of the peptide revealing different patterns of 
sensitivity towards various MHC molecules and T cell receptors. In addition, multiple 



substitutions using small, relatively neutral moieties such as Ala, Gly, Pro, or similar 
residues may be employed. The substitutions may be homo-oligomers or hetero-oligomers. 
The number and types of residues which are substituted or added depend on the spacing 
necessary between essential contact points and certain functional attributes which are sought 
(e.g., hydrophobicity versus hydrophilicity). Increased binding affinity for an MHC molecule 
or T cell receptor may also be achieved by such substitutions, compared to the affinity of the 
parent peptide. In any event, such substitutions should employ amino acid residues or other 
molecular fragments chosen to avoid, for example, steric and charge interference which 
might disrupt binding. 

[0042] Amino acid substitutions are typically of single residues. Substitutions, deletions, 
insertions or any combination thereof may be combined to arrive at a final peptide. 
Substitutional variants are those in which at least one residue of a peptide has been removed 
and a different residue inserted in its place. Such substitutions generally are made in 
accordance with the following Table 2 when it is desired to finely modulate the 
characteristics of the peptide. 



TABLE 2 



Original Residue 
Ala 

Arg 
Asn 

Asp 

Cys 

Gin 

Glu 

Gly 

His 
He 
Leu 
Lys 
Met 
Phe 
Ser 
Thr 
Trp 
Tyr 
' Val 



Exemplary Substitution 

Ser 
Lys, His 

Gin 

Glu 

Ser 

Asn 

Asp 

Pro 
Lys; Arg 
Leu; Val 
lie; Val 
Arg; His 
Leu; He 
Tyr;Trp 
Thr 
Ser 
Tyr; Phe 
Trp; Phe 
He; Leu 
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[0043] Substantial changes in function (e.g., affinity for MHC molecules or T cell receptors) 
are made by selecting substitutions that are less conservative than those in Table 2, i.e., selecting 
residues that differ more significantly in their effect on maintaining (a) the structure of the 
peptide backbone in the area of the substitution, for example as a sheet or helical conformation, 
(b) the charge or hydrophobicity of the molecule at the target site or (c) the bulk of the side chain. 
The substitutions which in general are expected to produce the greatest changes in peptide 
properties will be those in which (a) hydrophilic residue, e.g. seryl, is substituted for (or by) a 
hydrophobic residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a residue having an 
electropositive side chain, e.g., lysl, arginyl, or histidyl, is substituted for (or by) an 
electronegative residue, e.g. glutamyl or aspartyl; or (c) a residue having a bulky side chain, e.g. 
phenylalanine, is substituted for (or by) one not having a side chain, e.g., glycine. 

[0044] The peptides may also comprise isosteres of two or more residues in the immunogenic 
peptide. An isostere as defined here is a sequence of two or more residues that can be substituted 
for a second sequence because the steric conformation of the first sequence fits a binding site 
specific for the second sequence. The term specifically includes peptide backbone modifications 
well known to those skilled in the art. Such modifications include modifications of the amide 
nitrogen, the a-carbon, amide carbonyl, complete replacement of the amide bond, extensions, 

deletions or backbone crosslinks. See, generally . Spatola, Chemistry and Biochemistry of Amino 
Acids, peptides and Proteins. Vol. VII (Weinstein ed., 1983). 

[0045] Modifications of peptides with various amino acid mimetics or unnatural amino acids 
are particularly useful in increasing the stability of the peptide in vivo . Stability can be assayed 
in a number of ways. For instance, peptidases and various biological media, such as human 
plasma and serum, have been used to test stability. See, e^, Verhoef et al., Eur. J. Drue Metab. 
Pharmacokin. 11:291-302(1986). Half life of the peptides of the present invention is 
conveniently determined using a 25% human serum (v/v) assay. The protocol is generally as 
follows. Pooled human serum (Type AB, non-heat inactivated) is delipidated by centrifugation 
before use. The serum is then diluted to 25% with RPMQ tissue culture media and used to test 
peptide stability. At predetermined time intervals a small amount of reaction solution is removed 
and added to either 6% aqueous trichloracetic acid or ethanol. The cloudy reaction sample is 
cooled (4°C) for 15 minutes and then spun to pellet the precipitated serum proteins. The 
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presence of the peptides is then determined by reversed-phase HPLC using stability-specific 
chromatography conditions. 

10046] The peptides of the present invention or analogs thereof which have CTL stimulating 
activity may be modified to provide desired attributes other than improved serum half life. For 
instance, the ability of the peptides to induce CTL activity can be enhanced by linkage to a 
sequence which contains at least one epitope that is capable of inducing a T helper cell response. 
Particularly preferred immunogenic peptides/T helper conjugates are linked by a spacer 
molecule. The spacer is typically comprised of relatively small, neutral molecules, such as amino 
acids or amino acid mimetics, which are substantially uncharged under physiological conditions. 
The spacers are typically selected from, e.g., Ala, Gly, or other neutral spacers of nonpolar 
amino acids or neutral polar amino acids. It will be understood that the optionally present spacer 
need not be comprised of the same residues and thus may be a hetero- or homo-oligomer. When 
present, the spacer will usually be at least one or two residues, more usually three to six residues. 
Alternatively, the CTL peptide may be linked to the T helper peptide without a spacer. 
The immunogenic peptide may be linked to the T helper peptide either directly or via a spacer 
either at the amino or carboxy terminus of the CTL peptide. The amino terminus of either the 
immunogenic peptide or the T helper peptide may be acylated. Exemplary T helper peptides 
include tetanus toxoid 830-843, influenza 307-319, malaria circumsporozoite 382-398 and 
378-389. 

[0047] In some embodiments it may be desirable to include in the pharmaceutical 
compositions of the invention at least one component which primes CTL. Lipids have been 
identified as agents capable of priming CTL in vivo against viral antigens. For example, palmitic 
acid residues can be attached to the alpha and epsilori amino groups of a Lys residue and then 
linked, e.g., via one or more linking residues such as Gly, Gly-Gly-, Ser, Ser-Ser, or the like, to 
an immunogenic peptide. The lipidated peptide can then be injected directly in a micellar form, 
incorporated into a liposome or emulsified in an adjuvant, e.g., incomplete Freund's adjuvant. In 
a preferred embodiment a particularly effective immunogen comprises palmitic acid attached to 
alpha and epsilon amino groups of Lys, which is attached via linkage, e.g., Ser-Ser, to the amino 
terminus of the immunogenic peptide. 



[0048] As another example of lipid priming of CTL responses, E. coli lipoproteins, such as 
tripalmitoyl-S-glycerylcysteinlyseryl-serine (P 3 CSS) can be used to prime virus specific CTL 
when covalently attached to an appropriate peptide. See, Deres et al., Nature 342:561-564 
(1989), incorporated herein by reference. Peptides of the invention can be coupled to P 3 CSS, for 
example, and the lipopeptide administered to an individual to specifically prime a CTL response 
to the target antigen. Further, as the induction of neutralizing antibodies can also be primed with 
P3CSS conjugated to a peptide which displays an appropriate epitope, the two compositions can 
be combined to more effectively elicit both humoral and cell-mediated responses to infection. 

[0049] In addition, additional amino acids can be added to the termini of a peptide to provide 
for ease of linking peptides one to another, for coupling to a carrier support, or larger peptide, for 
modifying the physical or chemical properties of the peptide or oligopeptide, or the like. Amino 
acids such as tyrosine, cysteine, lysine, glutamic or aspartic acid, or the like, can be introduced at 
the C- or N-terminus of the peptide or oligopeptide. Modification at the C terminus in some 
cases may alter bindingcharacteristics of the peptide. In addition, the peptide or oligopeptide 
sequences can differ from the natural sequence by being modified by terminal-NH 2 acylation, 
e.g., by alkanoyl (C1-C20) or thioglycolyl acetylation, terminal-carboxyl amidation, e.g., 
ammonia, methylamine, etc. In some instances these modifications may provide sites for linking 
to a support or other molecule. 

[0050] The peptides of the invention can be prepared in a wide variety of ways. Because of 
their relatively short size, the peptides can be synthesized in solution or on a solid support in 
accordance with conventional techniques. Various automatic synthesizers are commercially 
available and can be used in accordance with known protocols. See, for example, Stewart and 
Young, Solid Phase Peptide Synthesis. 2d. ed., Pierce Chemical Co. (1984), supra . 

[0051] Alternatively, recombinant DNA technology may be employed wherein a nucleotide 
sequence which encodes an immunogenic peptide of interest is inserted into an expression 
vector, transformed or transfected into an appropriate host cell and cultivated under conditions 
suitable for expression. These procedures are generally known in the art, as described generally 
in Sambrook et al., Molecular Cloning. A Laboratory Manual. Cold Spring Harbor Press, Cold 
Spring Harbor, New York (1 982), which is incorporated herein by reference. Thus, fusion 
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proteins which comprise one or more peptide sequences of the invention can be used to present 
the appropriate T cell epitope. 

(0052] As the coding sequence for peptides of the length contemplated herein can be 
synthesized by chemical techniques, for example, the phosphotriester method of Matteucci et al., 

_ i 

J. Am. Chem. Soc. 103:3185 (1981), modification can be made simply by substituting the 
appropriate base(s) for those encoding the native peptide sequence. The coding sequence can 
then be provided with appropriate linkers and ligated into expression vectors commonly available 
in the art, and the vectors used to transform suitable hosts to produce the desired fusion protein. 
A number of such vectors and suitable host systems are now available. For expression of the 
fusion proteins, the coding sequence will be provided with operably linked start and stop codons, 
promoter and terminator regions and usually a replication system to provide an expression vector 
for expression in the desired cellular host. For example, promoter sequences compatible with 
bacterial hosts are provided in plasmids containing convenient restriction sites for insertion of the 
desired coding sequence. The resulting expression vectors are transformed into suitable bacterial 
hosts. Of course, yeast or mammalian cell hosts may also be used, employing suitable vectors 
and control sequences. 

[0053] The peptides of the present invention and pharmaceutical and vaccine compositions 
thereof are useful for administration to mammals, particularly humans, to treat and/or prevent 
viral infection and cancer. Examples of diseases which can be treated using the immunogenic 
peptides of the invention include prostate cancer, hepatitis B, hepatitis C, AIDS, renal carcinoma, 
cervical carcinoma, lymphoma, CMV and condlyloma acuminatum. 

[0054] For pharmaceutical compositions, the immunogenic peptides of the invention are 
administered to an individual already suffering from cancer or infected with the virus of interest. 
Those in the incubation phase or the acute phase of infection can be treated with the 
immunogenic peptides separately or in conjunction with other treatments, as appropriate. In 
therapeutic applications, compositions are administered to a patient in an amount sufficient to 
elicit an effective CTL response to the virus or tumor antigen and to cure or at least partially 
arrest symptoms and/or complications. An amount adequate to accomplish this is defined as 
"therapeutically effective dose." Amounts effective for this use will depend on, e.g., the peptide 
composition, the manner of administration, the stage and severity of the disease being treated, the 



weight and general state of health of the patient, and the judgment of the prescribing physician, 
but generally range for the initial immunization (that is for therapeutic or prophylactic 
administration) from about 1 .0 fig to about 5000 ^g of peptide for a 70 kg patient, followed by 
boosting, dosages of from about 1.0 /xg to about 1000 /xg of peptide pursuant to a boosting 
regimen over weeks to months depending upon the patient's response and condition by measuring 
specific CTL activity in the patient's blood. It must be kept in mind that the peptides and 
compositions of the present invention may generally be employed in serious disease states, that 
is, life-threatening or potentially life threatening situations. In such cases, in view of the 
minimization of extraneous substances and the relative nontoxic nature of the peptides, it is 
possible and may be felt desirable by the treating physician to administer substantial excesses of 
these peptide compositions. 

[0055] For therapeutic use, administration should begin at the first sign of viral infection or the 
detection or surgical removal of tumors or shortly after diagnosis in the case of acute infection. 
This is followed by boosting doses until at least symptoms are substantially abated and for a 
period thereafter. In chronic infection, loading doses followed by boosting doses may be 
required. 

[0056] Treatment of an infected individual with the compositions of the invention may hasten 
resolution of the infection in acutely infected individuals. For those individuals susceptible (or 
predisposed) to developing chronic infection the compositions are particularly useful in methods 
for preventing the evolution from acute to chronic infection. Where the susceptible individuals 
are identified prior to or during infection, for instance, as described herein, the composition can 
be targeted to them, minimizing need for administration to a larger population. 

[0057] The peptide compositions can also be used for the treatment of chronic infection and to 
stimulate the immune system to eliminate virus-infected cells in carriers. It is important to 
provide an amount of immuno-potentiating peptide in a formulation and mode of administration 
sufficient to effectively stimulate a cytotoxic T cell response. Thus, for treatment of chronic 
infection, a representative dose is in the range of about 1.0 /xg to about 5000 /xg, preferably about 
5 /xg to 1000 /xg for a 70 kg patient per dose. Immunizing doses followed by boosting doses at 

established intervals, e.g., from one to four weeks, may be required, possibly for a prolonged 
period of time to effectively immunize an individual. In the case of chronic infection, 
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administration should continue until at least clinical symptoms or laboratory tests indicate that 
the viral infection has been eliminated or substantially abated and for a period thereafter. 

[0058] The pharmaceutical compositions for therapeutic treatment are intended for parenteral, 
topical, oral or local administration. Preferably, the pharmaceutical compositions are 
administered parenterally, e.g., intravenously, subcutaneously, intradermally, or intramuscularly. 
Thus, the invention provides compositions for parenteral administration which comprise a 
solution of the immunogenic peptides dissolved or suspended in an acceptable carrier, preferably 
an aqueous carrier. A variety of aqueous carriers may be used, e.g., water, buffered water, 0.8% 
saline, 0.3% glycine, hyaluronic acid and the like. These compositions may be sterilized by 
conventional, well known sterilization techniques, or may be sterile filtered. The resulting 
aqueous solutions may be packaged for use as is, or lyophilized, the lyophilized preparation 
being combined with a sterile solution prior to administration. The compositions may contain 
pharmaceutical^ acceptable auxiliary substances as required to approximate physiological 
conditions, such as pH adjusting and buffering agents, tonicity adjusting agents, wetting agents 
and the like, for example, sodium acetate, sodium lactate, sodium chloride, potassium chloride, 
calcium chloride, sorbitan monolaurate, triethanolamine oleate, etc. 

[0059] The concentration of CTL stimulatory peptides of the invention in the pharmaceutical 
formulations can vary widely, i.e., from less than about 0.1%, usually at or at least about 2% to as 
much as 20% to 50% or more by weight, and will be selected primarily by fluid volumes, 
viscosities, etc., in accordance with the particular mode of administration selected. 

[0060] The peptides of the invention may also be administered via liposomes, which serve to 
target the peptides to a particular tissue, such as lymphoid tissue, or targeted selectively to 
infected cells, as well as increase the half-life' of the peptide composition. Liposomes include 
emulsions, foams, micelles, insoluble monolayers, liquid crystals, phospholipid dispersions, 
lamellar layers and the like. In these preparations the peptide to be delivered is incorporated as 
part of a liposome, alone or in conjunction with a molecule which binds to, e.g., a receptor . 
prevalent among lymphoid cells, such as monoclonal antibodies which bind to the CD45 antigen, 
or with other therapeutic or immunogenic compositions. Thus, liposomes either filled or 
decorated with a desired peptide of the invention can be directed to the site of lymphoid cells, 
where the liposomes then deliver the selected therapeutic/immunogenic peptide compositions. 



Liposomes for use in the invention are formed from standard vesicle-forming lipids, which 
generally include neutral and negatively charged phospholipids and a sterol, such as cholesterol. 
The selection of lipids is generally guided by consideration of, e.g., liposome size, acid lability 
and stability of the liposomes in the blood stream. A variety of methods are available for 
preparing liposomes, as described in, e.g., Szoka et al., Ann. Rev. Biophvs. Bioeng . 9:467 
(1980), U.S. Patent Nos. 4,235,871, 4,501,728, 4,837,028, and 5,019,369, incorporated herein by 
reference. 

[0061] For targeting to the immune cells, a ligand to be incorporated into the liposome can 
include, e.g., antibodies or fragments thereof specific for cell surface determinants of the desired 
immune system cells. A liposome suspension containing a peptide may be administered 
intravenously, locally, topically, etc. in a dose which varies according to. inter alia, the manner of 
administration, the peptide being delivered, and the stage of the disease being treated. 

[0062] For solid compositions, conventional nontoxic solid carriers may be used which 
include, for example, pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, 
sodium saccharin, talcum, cellulose, glucose, sucrose, magnesium carbonate, and the like. For 
oral administration, a pharmaceutically acceptable nontoxic composition is formed by 
incorporating any of the normally employed excipients, such as those carriers previously listed, 
and generally 10-95% of active ingredient, that is, one or more peptides of the invention, and 
more preferably at a concentration of 25%-75%. 

[0063] For aerosol administration, the immunogenic peptides are preferably supplied in finely 
divided form along with a surfactant and propellant. Typical percentages of peptides are 0.01%- 
20% by weight, preferably 1%-10%. The surfactant must, of course, be nontoxic, and preferably 
soluble in the propellant. Representative of such agents are the esters or partial esters of fatty 
acids containing from 6 to 22 carbon atoms, such as caproic, octanoic, lauric, palmitic, stearic, 
linoleic, linolenic, olesteric and oleic acids with an aliphatic polyhydric alcohol or its cyclic 
anhydride. Mixed esters, such as mixed or natural glycerides may be employed. The surfactant 
may constitute 0.1%-20% by weight of the composition, preferably 0.25-5%. The balance of the 
composition is ordinarily propellant. A carrier can also be included, as desired, as with, e.g., 
lecithin for intranasal delivery. 



[0064J In another aspect the present invention is directed to vaccines which contain as an 
active ingredient an immunogenically effective amount of an immunogenic peptide as described 
herein. The peptide(s) may be introduced into a host, including humans, linked to its own carrier 
or as a homopolymer or heteropolymer of active peptide units. Such a polymer has the advantage 
of increased immunological reaction and, where different peptides are used to make up the 
polymer, the additional ability to induce antibodies and/or CTLs that react with different 
antigenic determinants of the virus or tumor cells. Useful carriers are well known in the art, and 
include, e.g., thyro globulin, albumins such as human serum albumin, tetanus toxoid, polyamino 
acids such as poly(lysine: glutamic acid), influenza, hepatitis B virus core protein, hepatitis B 
virus recombinant vaccine and the like. The vaccines can also contain a physiologically tolerable 
(acceptable) diluent such as water, phosphate buffered saline, or saline, and further typically 
include an adjuvant. Adjuvants such as incomplete Freund's adjuvant, aluminum phosphate, 
aluminum hydroxide, or alum are materials well known in the art. And, as mentioned above, 
CTL responses can be primed by conjugating peptides of the invention to lipids, such as P3CSS. 
Upon immunization with a peptide composition as described herein, via injection, aerosol, oral, 
transdermal or other route, the immune system of the host responds to the vaccine by producing 
large amounts of CTLs specific for the desired antigen, and the host becomes at least partially 
immune to later infection, or resistant to developing chronic infection. 

[0065] Vaccine compositions containing the peptides of the invention are administered to a 
patient susceptible to or otherwise at risk of viral infection or cancer to elicit an immune response 
against the antigen and thus enhance the patient's own immune response capabilities. Such an 
amount is defined to be an "immunogenically effective dose." In this use, the precise amounts 
again depend on the patient's state of health and weight, the mode of administration, the nature of 
the formulation, etc., but generally range from about 1.0 fig to about 5000 fig per 70 kilogram 
patient, more commonly from about 10 fig to about 500 fig mg per 70 kg of body weight. 

[0066] In some instances it maybe desirable to combine the peptide vaccines of the invention 
with vaccines which induce neutralizing antibody responses to the virus of interest, particularly 
to viral envelope antigens. 

[0067] For therapeutic or immunization purposes, the peptides of the invention can also be 
expressed by attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use 



of vaccinia vims as a vector to express nucleotide sequences that encode the peptides of the 
invention. Upon introduction into an acutely or chronically infected host or into a non-infected 
host, the recombinant vaccinia virus expresses the immunogenic peptide, and thereby elicits a 
host CTL response. Vaccinia vectors and methods useful in immunization protocols are 
described in, e.g., U.S. Patent No. 4,722,848, incorporated herein by reference. Another vector is 
BCG (Bacille Calmette Guerin). BCG vectors are described in Stover et al. ( Nature 35 1 :456- 
460 (1991)) which is incorporated herein by reference. A wide variety of other vectors useful for 
therapeutic administration or immunization of the peptides of the invention, e.g., Salmonella 
typhi vectors and the like, will be apparent to those skilled in the art from the description herein. 

[0068] Antigenic peptides may be used to elicit CTL ex vivo , as well. The resulting CTL, can 
be used to treat chronic infections (viral or bacterial) or tumors in patients that do not respond to 
other conventional forms of therapy, or will not respond to a peptide vaccine approach of 
therapy. Ex vivo CTL responses to a particular pathogen (infectious agent or tumor antigen) are 
induced by incubating in tissue culture the patient's CTL precursor cells (CTLp) together with a 
source of antigen-presenting cells (APC) and the appropriate immunogenic peptide. After an 
appropriate incubation time (typically 1-4 weeks), in which the CTLp are activated and mature 
and expand into effector CTL, the cells are infused back into the patient, where they will destroy 
their specific target cell (an infected cell or a tumor cell). 

[0069] The peptides may also find use as diagnostic reagents. For example, a peptide of the 
invention may be used to determine the susceptibility of a particular individual to a treatment 
regimen which employs the peptide or related peptides, and thus may be helpful in modifying an 
existing treatment protocol or in determining a prognosis for an affected individual. In addition, 
the peptides may also be used to predict which individuals will be at substantial risk for 
developing chronic infection. 

[0070] The following examples are offered by way of illustration, not by way of limitation. 



Example 1 



Class I antigen isolation 

[0071] A flow diagram of an HLA-A antigen purification scheme is presented in Figure 1. 
Briefly, the cells bearing the appropriate allele were grown in large batches (6-8 liters yielding ~5 
x 109 cells), harvested by centrifugation and washed. All cell lines were maintained in RPMI 
1640 media (Sigma) supplemented with 10% fetal bovine serum (FBS) and antibiotics. For 
large-scale cultures, cells were grown in roller bottle culture in RPMI 1640 with 10% FBS or 
with 10% horse serum and antibiotics. Cells were harvested by centrifugation at 1 500 RPM EEC- 
CRU5000 centrifuge with 259 rotor and washed three times with phosphate-buffered saline 
(PBS) (0.01 M P04, 0.154 M NaCl, pH 7.2). 

[0072] Cells were pelleted and stored at -70°C or treated with detergent lysing solution to 
prepare detergent lysates. Cell lysates were prepared by the addition of stock detergent solution 
[1% NP-40 (Sigma) or Renex 30 (Accurate Chem. Sci. Corp., Westbury, NY 1 1590), 150 mM 
NaCl, 50 mM Tris, pH 8.0] to the cell pellets (previously counted) at a ratio of 50-100 x 10 6 cells 
per ml detergent solution. A cocktail of protease inhibitors was added to the premeasured 
volume of stock detergent solution immediately prior to the addition to the cell pellet. Addition 
of the protease inhibitor cocktail produced final concentrations of the following: 
phenylmethylsulfonyl fluoride (PMSF), 2 mM; aprotinin, 5 /ig/ml; leupeptin, 10 fig/ml; 
pepstatin, 10 /ig/ml; iodoacetamide, 100 fiM; and EDTA, 3 ng/ml. Cell lysis was allowed to 
proceed at 4°C for 1 hour with periodic mixing. Routinely 5-10 x 10 9 cells were lysed in 50-100 
ml of detergent solution. The lysate was clarified by centrifugation at 15,000 x g for 30 minutes 
at 4°C and subsequent passage of the supernatant fraction through a 0.2 M filter unit (Nalgene). 

[0073] The HLA-A antigen purification was achieved using affinity columns prepared with 
mAb-conjugated Sepharose beads. For antibody production, cells were grown in RPMI with 
10% FBS in large tissue culture, flasks (Corning 25160-225). Antibodies were purified from 
clarified tissue culture medium by ammonium sulfate fractionation followed by affinity 
chromatography on protein-A-Sepharose (Sigma). Briefly, saturated ammonium sulfate was 
added slowly with stirring to the tissue culture supernatant to 45% (volume to volume) overnight 
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at 4°C to precipitate the immunoglobulins. The precipitated proteins were harvested by 
centrifugation at 10,000 x g for 30 minutes. The precipitate was then dissolved in a minimum 
volume of PBS and transferred to dialysis tubing (Spectro/Por 2, Mol. wt. cutoff 12,000-14,000, 
Spectum Medical Ind.). Dialysis was against PBS ( ^£0 times the protein solution volume) with 
4-6 changes of dialysis buffer over a 24-48 hour period at 4°C. The dialyzed protein solution was 
clarified by centrifugation (10,000 x g for 30 minutes) and the pH of the solution adjusted to pH 
8.0 with IN NaOH. Protein- A-Sepharose (Sigma) was hydrated according to the manufacturer's 
instructions, and a protein- A-Sepharose column was prepared. A column of 1 0 ml bed volume 
typically binds 50-100 mg of mouse IgG. 

[0074] The protein sample was loaded onto the protein-A-Sepharose column using a peristaltic 
pump for large loading volumes or by gravity for smaller volumes (<100 ml). The column was 
washed with several volumes of PBS, and the eluate was monitored at A280 in a 
spectrophotometer until base line was reached. The bound antibody was eluted using 0. 1 M 
citric acid at suitable pH (adjusted to the appropriate pH with IN NaOH). For mouse IgG-1 pH 
6.5 was used for IgG2a pH 4.5 was used and for IgG2b and IgG3 pH 3.0 was used. 2 M Tris base 
was used to neutralize the eluate. Fractions containing the antibody (monitored by A280) were 
pooled, dialyzed against PBS and further concentrated using an Amicon Stirred Cell system 
(Amicon Model 8050 with YM30 membrane). The anti-A2 mAb, BB7.2, was useful for affinity 
purification. 

[0075] The HLA-A antigen was purified using affinity columns prepared with mAb- 
conjugated Sepharose beads. The affinity columns were prepared by incubating protein-A- 
Sepharose beads (Sigma) with affinity-purified mAb as described above. Five to 10 mg of mAb 
per ml of bead is the preferred ratio. The mAb bound beads were washed with borate buffer 
(borate buffer: 100 mM sodium tetraborate, 154 mM NaCl, pH 8.2) until the washes show A280 
at based line. Dimethyl pimelimidate (20 mM) in 200 mM triethanolamine was added to 
covalently crosslink the bound mAb to the protein- A-Sepharose (Schneider et al., J. Biol. Chem. 
257:10766 (1982). After incubation for 45 minutes at room temperature on a rotator, the excess 
crosslinking reagent was removed by washing the beads twice with 10-20 ml of 20 mM 
ethanolamine, pH 8.2. Between each one the slurry was placed on a rotator for 5 minutes at room 
temperature. The beads were washed with borate buffer and with PBS plus 0.02% sodium azide. 



100761 The cell lysate (5-10 x 10 9 cell equivalents) was then slowly passed over a 5-10 ml 
affinity column (flow rate of 0.1-0.25 ml per minute) to allow the binding of the antigen to the 
immobilized antibody. After the lysate was allowed to pass through the column, the column was 
washed sequentially with 20 column volumes of detergent stock solution plus 0.1% sodium 
dodecyl sulfate, 20 column volumes of 0.5 M NaCl, 20 mM Tris, pH 8.0, and 10 column 
volumes of 20 mM Tris, pH 8.0. The HLA-A antigen bound to the mAb was eluated with a basic 
buffer solution (50 mM diethylamine in water). As an alternative, acid solutions such as 0.15- 
0.25 M acetic acid were also used to elute the bound antigen. An aliquot of the eluate (1/50) was 
removed for protein quantification using either a colorimetric assay (BCA assay, Pierce) or by 
SDS-PAGE, or both. SDS-PAQE analysis was performed as described by Laemmli (Laemmli, 
U.K., Nature 227:680 (1970)) using known amounts of bovine serum albumin (Sigma) as a 
protein standard. Allele specific antibodies were used to purify the specific MHC molecule. In 
the case of HLA-A2, the mAb BB7.2 was used. 

Example 2 

Isolation and sequencing of naturally processed peptides 

[0077] For the HLA-A preparations derived from the base (50 mM diethylamine) elution 
protocol, the eluate was immediately neutralized with 1 N acetic acid to pH 7.0-7.5. The 
neutralized eluate was concentrated to a volume of 1-2 ml in an Amicon stirred cell [Model 
8050, with YM3 membranes (Amicon)]. Ten ml of ammonium acetate (0.01 M, pH 8.0) was 
added to the concentrator to remove the non-volatile salts, and the sample was concentrated to 
approximately 1 ml. A small sample (1/50) was removed for protein quantitation as described 
above. The remainder was recovered into a 15 ml polypropylene conical centrifuge tube (Falcon, 
2097) (Becton Dickinson). Glacial acetic acid was added to obtain a final concentration of 10% 
acetic acid. The acidified sample was placed in a boiling water bath for 5 minutes to allow for 
the dissociation of the bound peptides. The sample was cooled on ice, returned to the 
concentrator and the filtrate was collected. Additional aliquots of 10% acetic acid (1-2 ml) were 
added to the concentrator, and this filtrate was pooled with the original filtrate. Finally, 1-2 ml of 
distilled water was added to the concentrator, and this filtrate was pooled as well. 



[0078] The retentate contains the bulk of the HLA-A heavy chain and ^-microglobulin, while 
the filtrate contains the naturally processed bound peptides and other components with molecular 
weights less than about 3000. The pooled filtrate material was lyophilized in order to 
concentrate the peptide fraction. The sample was then ready for further analysis. 

[0079] For HPLC (high performance liquid chromatography) separation of the peptide 
fractions, the lyophilized sample was dissolved in 50 /d of distilled water, or into 0.1% 
trifluoracetic acid (TFA) (Applied Biosystems) in water and injected to a C18 reverse-phase 
narrow bore column (Beckman CI 8 Ultrasphere, 10 x 250 mm), using a gradient system 
described by Stone and Williams (Stone, K.L. and Williams K.R., in, Macromolecular 
Sequencing and Synthesis; Selected Methods and Applications, A.R. Liss, New York, 1988, pp. 
7-24. Buffer A was 0.06% TFA in water (Burdick- Jackson) and buffer B was 0.052% TFA in 
80% acetonitrile (Burdick- Jackson). The flow rate was 0.250 ml/minute with the following 
gradient: 060 rhin., 2-37.5% B; 60-95 min., 37.5-75% B; 95-105 min., 75-98% B. The Gilson 
narrow bore HPLC configuration is particularly useful for this purpose, although other 
configurations work equally well. 

[0080] A large number of peaks were detected by absorbance at 214 nm, many of which 
appear to be of low abundance. Whether a given peak represents a single peptide or a peptide 
mixture was not determined. Pooled fractions were then sequenced to determine motifs specific 
for each allele as described below. 

[0081] Pooled peptide fractions, prepared as described above were analyzed by automated 
Edman sequencing using the Applied Biosystems Model 477A automated sequencer. The 
sequencing method is based on the technique developed by Pehr Edman in the 1950s for the 
sequential degradation of proteins and peptides to determine the sequence of the constituent 
amino acids. 

[0082] The protein or peptide to be sequenced was held by a 1 2-mm diameter porous glass 
fiber filter disk in a heated, argon-purged reaction chamber. The filter was generally pretreated 
with BioBrene Plus™ and then cycled through one or more repetitions of the Edman reaction to 
reduce contaminants and improve the efficiency of subsequent sample sequencing. Following 
the pre-treatment of the filter, a solution of the sample protein or peptide (10 pmol-5 nmol range) 
was loaded onto the glass filter and dried. Thus, the sample was left embedded in the film of the 



pre-treated disk. Covalent attachment of the sample to the filter was usually not necessary 
because the Edman chemistry utilized relatively apolar solvents, in which proteins and peptides 
are poorly soluble. 

[0083] Briefly, the Edman degradation reaction has three steps: coupling, cleavage, and 
conversion. In coupling step, phenylisothiocyanate (PITC) is added. The PITC reacts 
quantitatively with the free amino-terminal amino acid of the protein to form the 
phenylthiocarbamyl-protein in a basic environment. After a period of time for the coupling step, 
the excess chemicals are extracted and the highly volatile organic acid, trifluoroacetic acid, TFA, 
is used to cleave the PITC-coupled amino acid residue from the amino terminus of the protein 
yielding the anilinothiazolinone (ATZ) derivative of the amino acid. The remaining 
protein/peptide is left with a new amino terminus and is ready for the next Edman cycle. The 
ATZ amino acid is extracted and transferred to a conversion flask, where upon addition of 25% 
TFA in water, the ATZ amino acid is converted to the more stable phenylthiohydantoin (PTH) 
amino acid that can be identified and quantified following automatic injection into the Model 
120 PTH Analyzer which uses a microbore C-18 reverse-phase HPLC column for the analysis. 

[0084] In the present procedures, peptide mixtures were loaded onto the glass filters. Thus a 
single amino acid sequence usually does not result. Rather, mixtures of amino acids in different 
yield are found. When the particular residue is conserved among the peptides being sequenced, 
increased yield for that amino acid is observed. 



Example 3 

Definition of an A2.1 specific motif 

[0085] In one case, pooled peptide fractions prepared as described in Example 2 above were 
obtained from HLA-A2.1 homozygous cell lines, for example, JY. The pooled fractions were 
HPLC fractions corresponding to 7% to 45% CH 3 CN. For this class I molecule, this region of 
the chromatogram was most abundant in peptides. Data from independent experiments were 
averaged as described below. 

[0086] The amino acid sequence analyses from four independent experiments were analyzed 
and the results are shown in Table 3. For each position except the first, the data were analyzed by 
modifying the method described by Falk et al., supra, to allow for comparison of experiments 
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from chfferent HLA types. This modified procedure yielded quantitative yet standardized values 
while allowing the averaging of data from different experiments involving the same HLA type. 

[0087] The raw sequenator data was converted to a simple matrix of 10 rows (each 
representing one Edman degradation cycle) and 16 columns (each representing one of the twenty 
amino acids; W, C, R and H were eliminated for technical reasons. The data corresponding to 
the first row (first cycle) was not considered further because, this cycle is usually heavily 
contaminated by free amino acids.). The values of each row were summed to yield a total pmoles 
value for that particular cycle. For each row, values for each amino acid were then divided by the 
corresponding total yield value, to determine what fraction of the total signal is attributable to 
each amino acid at each cycle. By doing so, an "Absolute Frequency" table was generated. This 
absolute frequency table allows correction for the declining yields of each cycle. 
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TABLE 3 



A2.1: POOL SEQUENCING FREQUENCY 



A 

G 
D 
E 
R 
H 
K 

L 
V 
I 

M 

Y 

F 

W 

Q 

N 

S 

T 

C 

P 




0.85 
1.29 



0.89 

0.45 
0.60 
0.53 
0.62 
0.65 
0.69 

* 

0.92 

1.17 
1.33 
0.04 



>os. 5 
0.95 
1.22 
1.03 
1.10 




0.57 
0.79 
0.93 
0.71 
1.32 
1.16 

* 

0.95 
1.28 
0.87 
0.95 




1.44 1.15 



significant Increase 



poa. 6 


poa. 7 


0.77 


1.21 


0.89 


0.78 


0.83 


0.82 


0.82 


1.06 


\ 1.09 


* 

0.89 


1.00 


0.69 


1.38 


1.24 


1.49 


1.15 


0.68 


0.83 


0.78 


1.34 


1.00 


1.07 


* 

0.90 


* 

1.16 ;i 


1.08 


1.07 


0.77 


0.71 


1.21 


1.07 | 


* 

1.09 


* 

1.30 



1.15 
0.98 
0.82 
0.87 




0.87 



1° 
1.25 

1.48 

1.19 

0.86 



* 


• 


0.82 


0.B7 


0.92 


0.77 




1.27 


"o.88 


0.54 


0.73 


0.22 


1.00 


0.79 


0.78 


0.73 


* 

1.00 


1,00 


0.96 


0.42 


0.77 


0.58 


0.71 


0J&7 


• 

0.81 


1.01 
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100881 Starting from the absolute frequency table, a "relative frequency" table was then 
generated to allow comparisons among different amino acids. To do so the data from each 
column was summed, and then averaged. Then, each value was divided next by the average 
column value to obtain relative frequency values. These values quantitate, in a standardized 
manner, increases and decreases per cycle, for each of the different sixteen amino acid types. 
Tables generated from data from different experiments can thus be added together to generate 
average relative frequency values (and their standard deviations). All standard deviations can 
then be averaged, to estimate a standard deviation value applicable to the samples from each 
table. Any particular value exceeding 1.00 by more than two standard deviations is considered to 
correspond to a significant increase. 



Example 4 

Quantitative Binding Assays 

[0089] Using isolated MHC molecules prepared as described in Example 2, above, 
quantitative binding assays were performed. Briefly, indicated amounts of MHC as isolated 
above were incubated in 0.05% NP40-PBS with ~5 nM of radiolabeled peptides in the presence 
of 1-3 [iM /3 2 M and a cocktail of protease inhibitors (final concentrations 1 mM PMSF, 1.3 mM 
1.10 Phenanthroline, 73 /M Pepstatin A, 8 mM EDTA, 200 fiM N-o>p-tosyl-L-Lysine 
Chloromethyl ketone). After various times, free and bound peptides were separated by TSK 
2000 gel filtration, as described previously in A. Sette et al., J. Immunol. 148:844 (1992), which 
is incorporated herein by reference. Peptides were labeled by the use of the Chloramine T 
method Buus et al., Science 235:1352 (1987), which is incorporated herein by reference. 

[0090] The HBc 18-27 peptide HLA binding peptide was radiolabeled and offered (5-10 nM) 
to 1 fM purified HLA A2.1. After two days at 23°C in presence of a cocktail of protease 
inhibitors and 1-3- /iM purified human 0 2 M, the percent of MHC class I bound radioactivity was 
measured by size exclusion chromatography, as previously described for class II peptide binding 
assays in Sette et al., in Seminars in Immunology, Vol. 3, Gefter, ed. (W.B. Saunders, 
Philadelphia, 1991), pp 195-202, which is incorporated herein by reference. Using this protocol, 
high binding (95%) was detected in all cases in the presence of purified HLA A2.1 molecules. 



[0091] To explore the specificity of binding, we determined whether the binding was 
inhabitable by excess unlabeled peptide, and if so, what the 50% inhibitory concentration 
(IC50%) might be. The rationale for this experiment was threefold. First, such an experiment is 
crucial in order to demonstrate specificity. Second, a sensitive inhibition assay is the most viable 
alternative for a high throughput quantitative binding assay. Third, inhibition data subjected to 
Scatchard analysis can give quantitative estimates of the equilibrium constant (K) of interaction 
and the fraction of receptor molecules capable of binding ligand (% occupancy). For instance, in 
analysis of an inhibition curve for the interaction of the peptide HBc 18-27 with A2.1, the IC50% 
was determined to be 25 nM. Further experiments were conducted to obtain Scatchard plots. 
For HBc 18-27/A2.1, six different experiments using six independent MHC preparations yielded 
a K D Of 1 5.5 ± 9.9 x 10" 9 M and occupancy values of 6.2% (±1 .4). 

[0092] Several reports have demonstrated that class I molecules, unlike class II, are highly 
selective with regard to the size of the peptide epitope that they recognize. The optimal size 
varies between 8 and 10 residues for different peptides and different class I molecules, although 
MHC binding peptides as long as 13 residues have been identified. To verify the stringent size 
requirement, a series of N- and C-terminal truncation/extension analogs of the peptide HBc 18- 
27 were synthesized and tested for A2.1 binding. Previous studies had demonstrated that the 
optimal size for CTL recognition of this peptide was the 10-mer HBcl8-27 (Sette et al. supra). It 
was found that removal or addition of a residue at the C terminus of the molecule resulted in a 30 
to 100-fold decrease in binding capacity. Further removal or addition of another-residue 
completely obliterated binding. Similarly, at the N-terminus of the molecule, removal or deletion 
of one residue from the optimal HBc 18-27 peptide completely abrogated A2.1 binding. 

[0093] Throughout this disclosure, results have been expressed in terms of IC50's. Given the 
conditions in which our assays are run (i.e., limiting MHC and labeled peptide concentrations), 
these values approximate Kd values. It should be noted that IC50 values can change, often 
dramatically, if the assay conditions are varied, and depending on the particular reagents used 
(e.g., Class I preparation, etc.). For example, excessive concentrations of MHC will increase the 
apparent measured IC50 of a given ligand. 

[0094] An alternative way of expressing the binding data, to avoid these uncertainties, is as a 
relative value to a reference peptide. The reference peptide is included in every assay. As a 



particular assay becomes more, or less, sensitive, the IC50's of the peptides tested may change 
somewhat. However, the binding relative to the reference peptide will not change. For example, 
in an assay run under conditions such that the IC50 of the reference peptide increases 10-fold, all 
IC50 values will also shift approximately ten-fold. Therefore, to avoid ambiguities, the 
assessment of whether a peptide is a good, intermediate, weak, or negative binder should be 
based on it's IC50, relative to the IC50 of the standard peptide. 

[0095] The reference peptide for the HLA-A2.1 assays described herein is referred to as 
941.01 having a sequence of FLPSDYFPSV. An average IC50 of 5 (nM) was observed under 
the assay conditions utilized. 

[00961 If the IC50 of the standard peptide measured in a particular assay is different from that 
reported in the table, then it should be understood that the threshold values used to determine 
good, intermediate, weak, and negative binders should be modified by a corresponding factor. 
For example, if in an A2.1 binding assay, the IC50 of the A2.1 standard (941.01) were to be 
measured as 8 nM instead of 5 nM, then a peptide ligand would be called a good binder only if it 
had an IC50 of less than 80 nM (i.e., 8nM x 0.1), instead of the usual cut-off value of 50 nM. 

Example 5 

HLA-A2.1 Binding Motif and Algorithm 

[0097] The structural requirements for peptide binding to A2.1 have been defined for both, 9- 
mer and 10-rrier peptides. Two approaches have been used. The first approach referred to as the 
"poly- A approach' 1 uses a panel of single amino acid substitutions of a 9-mer prototype poly-A 
binder (ALAKAAAAV) that is tested for A2.1 binding using the methods of Example 4 above to 
examine the degree of degeneracy of the anchor-positions and the possible influence of non- 
anchor positions on A2.1 binding. 

[0098] The second approach, the "Motif-Library approach", uses a large library of peptides 
selected from sequences of potential target molecules of viral and tumor origin and tested for 
A2. 1 binding using the methods in Example 4 above. The frequencies by which different amino- 
acids occurred at each position in good binders and non-binders were analyzed to further define 
the role of non-anchor positions in 9-mers and 10-mers. 



- 32 - 



A2.1 binding of peptide 9-mers 

[0099] Poly A Approach A poly-A 9-mer peptide, containing the A2.1 motif L (Leu) in 
position 2 and V (Val) in position 9 was chosen as a prototype binder. A K (Lys) residue was 
included in position 4 to increase solubility. A panel of 91 single amino-acid substitution 
analogues of the prototype parental 9-mer was synthesized and tested for A2.1 binding (Table 4). 
Shaded areas mark analogs with a greater than 10-fold reduction in binding capacity relative to 
the parental peptide. A reduction in binding greater than 100-fold is indicated a dash. 

[00100] Anchor-Positions 2 and 9 in poly-A Analogs The effect of single-amino-acid 
substitutions at the anchor positions 2 and 9 was examined first. Most substitutions in these 
positions had profound detrimental effects on binding capacity, thus confirming their role for 
binding. More specifically, in position 2 only L and M bound within a 10-fold range ("preferred 
residues"). Residues with similar characteristics, such as I, V, A, and T were tolerated, but 
bound 10 to 100-fold less strongly than the parental peptide. All the remaining substitutions 
(residues S, N, D, F, C, K, G, and P) were not tolerated and decreased binding by more than 100- 
fold. Comparably stringent requirements were observed for position 9, where V, L and I were 
preferred and A and M are tolerated, while the residues T, C, N, F, and Y virtually abolished 
binding. According to this set of peptides, an optimal 2-9 motif could be defined with L, M in 
position 2 and V, I, or L in position 9. 
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TABLE 4 



A2.1 binding of analogs of a motlf-contalnlng poly A peptide 




Ratio £0.1 



pos. 5 


DOS. 6 


pos. 7 


A 


A 


A 


1.00 


1.00 


1.00 


0.63 


0.12 




0.S1 






0.62 


0.15 


0.28 








0.39 




0.50 


0.99 




0.76 




0.44 


0.49 




1.12 




0.94 


0.41 


1.40 




1.76 




0.32 




0.19 


1.24 




0.97 






0.28 




0.84 






0.84 


0.63 



pos. B 

A 
1.00 
0.57 

0.26 

0.24 
0.11 
0.90 
0.30 



0.43 
0.49 

0.41 
0.31 

0.37 
0.5C 



pos. 9 
V 



0.11 
1.00 
0.18 



MS 



mm 



rf. ' .....J8UL. 



Ratio s 0.01 
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[00101] Non-Anchor Positions 1 and 3-8 in poly-A Analogs All non-anchor positions 
were more permissive to different substitutions than the anchor-positions 2 and 9, i.e. most 
residues were tolerated. Significant decreases in binding were observed for some substitutions in 
distinct positions. More specifically, in position 1 a negative charge (residues D and E) or a P 
greatly reduced the binding capacity. Most substitutions were tolerated in position 3 with the 
exception of the residue K. Significant decreases were also seen in position 6 upon introduction 
of either a negative charge (D, E) or a positively charged residue (R). A summary of these 
effects by different single amino acid substitutions is given in Table 5. 



TABLE 5 



Summary A2.1 



Poly-A 



[00102] The Motif-Library Approach To further evaluate the importance of non-anchor 
positions for binding, peptides of potential target molecules of viral and tumor origin were 
scanned for the presence of sequences containing optimal 2-9 anchor motifs. A set of 161 
peptides (appendix I) containing a L or M in position 2 and a V, L or I in position 9 was selected, 
synthesized and tested for binding (see Table 17). Only 1 1 .8% of these peptides bind with high 
affinity (ratio ^-10; 22.4% were intermediate binders (ratio 20.1). As many as 36% were weak 
binders (ratio O.01 - 0.0001), and 31% were non-binders (ratioO.OOOl). The high number of 
non-binders containing optimal anchor-motifs indicates that in this set of peptides positions other 
than the 2-9 anchors influence A2.1 binding capacity. Appendix 1 sets forth all of the peptides 
having the 2-9 motif used for this analysis and the binding data for those peptides. 

[00103] To define the influence of non-anchor positions more specifically, the frequency 
of occurrence of each amino acid in each of the non-anchor positions was calculated for the good 
and intermediate binders on one hand and non-binders on the other hand. Amino acids of similar 
chemical characteristic were grouped together (Table 6). Weak binders were not considered for 
the following analysis. The frequency of occurrence of each amino acid in each of the non- 
anchor positions was calculated for the good binders and non-binders (Table 6). 

[00104] Several striking trends become apparent. For example in position 1, only 3.6% of 
the A2.1 binders and as much as 35% of the non-binders carried a negative charge (residues D 
and E). This observation correlates well with previous findings in the set of poly-A analogs, 
where a D or E substitution greatly affected binding. Similarly, the residue P was 8 times more 
frequent in non-binders than in good binders. Conversely, the frequencies of aromatic residues 
(Y, F, W) were greatly increased in A2.1 binders as compared to non-binders. 
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[00105] Following this approach, amino acids of similar structural characteristics were 
grouped together. Then, the frequency of each amino acid group in each position was calculated 
for binders versus non-binders. Finally, the frequency in the binders group was divided by the 
frequency in the non-binders to obtain a "frequency ratio". (Table 7) This ratio indicates whether 
a given amino-acid or group of residues occurs in a given position preferentially in good binders 
(ratio >1) or in non-binders (ratio <1). 
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TABLE 7 



A2.1 9-mer PEPTIDES 

NUMBER OF PEPTIDES 
GOOD BINDERS 

INTERMEDIATE BINDERS 
WEAK BINDERS 
NON-BINDERS 



161 

19 11.8% 
36 22.4% 
58 36.0% 
48 29.8% 



pos. 1 


I POS. 2 


pos. 3 


ratio 


ratio 


_ ratio 


2.6 


J NA~~ 


0.9 


3.5 


NA 


0.4 


0.1 


NA 


0.0 


3.1 


NA 


0.2 


3.1 


1.0 


1.8 


7.0 


NA 


5.2 


0.5 


NA 


0.4 


0.7 


NA 


1.9 


0.1 


NA 


0.7 



no negative binders 



pos. 8 


POS. 9 


ratio 


ratio 


0.3 


NA 


0.4 


NA 


0.9 


NA 


1.3 


NA 


1.7 


1.0 


2.6 


NA 


0.3 


NA 


1.1 


NA 


+++ 


NA 
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[00106] Different Residues Influence A2.1 Binding In order to analyze the most striking 
influences of certain residues on A2.1 binding, a threshold level was set for the ratios described 
in Table 7. Residues showing a more than 4-fold greater frequency in good binders were regarded 
as preferred residues (+). Residues showing a 4-fold lower frequency in A2.1 binders than in 
non-binders were regarded as disfavored residues (-). Following this approach, residues showing 
the most prominent positive or negative effects on binding are listed in Table 8. 

[001 07] This table identifies the amino acid groups which influence binding most 
significantly in each of the non-anchor positions. In general, the most negative effects were 
observed with charged amino acids. In position 1 . negatively charged amino acids were not 
observed in good binders, i.e., those amino acids were negative binding residues at position 1. 
The opposite was true for position 6 where only basic amino acids were detrimental for binding 
i.e., were negative binding residues. Moreover, both acidic and basic amino acids were not 
observed in A2. 1 binders in positions 3 and 7. A greater than 4-fold increased frequency of non- 
binders was found when P was in position 1 . 



TABLE 8 



Summary of A2.1 Motif-Library , 9-mers 




(+) ■ Ratio > 4-fo!d 



(-) - Ratio < 0.25 



1001 08| Aromatic residues were in genera! favored in several of the „o„-a„chor positions 
pamculariy in positions !, 3, and 5. SmaU residues like S, T, and C were favored in position 4 ' 

and A was favored in position 7. 

100109] An Improved A2.1 9- mer Motif The data described above was used to derive a 
stringent A2.1 motif. This motif is based in significant part on the effects of the non-anchor 
positions 1 and 3-8. The uneven distribution of amino acids at different positions is reflective of 
specfic dominant negative binding effects of certain residues, mainly charged ones, on binding 
affimty. A series of rules were derived to identify appropriate anchor residues in positions 2 and 
9 and negative binding residues at positions 1 and 3-8 to enable selection of a high affinity 
bmding immunogenic peptide.. These rules are summarized in Table 9. 

100110] To validate the motif defined above and shown in Table 9 published sequences of 
peptides that have been naturally processed and presented by A2. 1 molecules were analyzed 
(Table 1 0). Only 9-merpeptides containing the 2-9 anchor residues were considered. 

(00111] When the frequencies of these peptides were analyzed, it was found that in general 
they followed the rules summarized in Table 9. More specifically, neither acidic amino acids nor 
P were found in position 1 . Only one acidic amino acid and no basic amino acids were found in 
position 3. Positions 6 and 7 showed no charged residues. Acidic amino acids, however were 
frequently found in position 8, where they are tolerated, according to our definition of the A2 1 
motif. The analysis of the sequences of naturally processed peptides therefore reveals that >90% 
of the peptides followed the defined rules for a complete motif. 

[001 12] Thus the data confirms a role of positions other than the anchor positions 2 and 9 
for A2.1 binding. Most of the deleterious effects on binding are induced by charged amino acids 
in non-anchor positions, i.e. negative binding residues occupying positions 1 , 3, 6 or 7 
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TABLE 9 



A2.1 Motif for 9-mer peptides 



I 



AA Position 
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TABLE 10 



A2.1 natur ally proce ssed pep tides 




A2.1 Binding of Peptide 10-mers 

[00! 131 The "Motif-Library" Approach Previous data clearly indicated that . 0-m.rs can 
also hnd to HLA molecules even if with a somewhat lower affinity than 9-mers. For this reason 
we expanded our analysis to 10-mer peptides. 

[00114] Therefore, a "Motif-Library" set of 170 peptide 10-mers containing optimal motif- 
combmations was selected from known target molecule sequences of viral and tumor origin and 
analyzed as described above for 9-mers. In this set we found 5.9% good binders, 17 l% 
intermediate binders, 41.2o/ 0 weak binders and 35.9o/o non-binders. The actual sequences origin 
and binding capacities of this set of peptides are included as Appendix 2. This set of 10-mers was 
used to determine a) the rules for 10-mer peptide binding to A2.1, b) the similarities or 
differences to rules defined for 9-mers, and c) if an insertion point can be identified that would 
allow for a superimposable common motif for 9-mers and 1 0-mers. 

[00115] Amino-acid frequencies and frequency ratios for the various amino-acid groups 
for each position were generated for 1 0-mer peptides as described above for 9-mer peptides and 
are also shown in Tables 1 1 and 12, respectively for grouped residues. 

[00116] A summary of preferred versus disfavored residues and of the rules derived for the 
10-mers in a manner analogous to that used for 9-mers, is also listed in Tables 13 and 14, 
respectively. 

[00117] When the frequency-ratios of different amino-acid groups in binders and non- 
binders at different positions were analyzed and compared to the corresponding ratios for the 9- 
mers, both striking similarities and significant differences emerged (Table 15). At the N- 
terminus and the C-termini of 9-mers and 10-mers, similarities predominate. In position 1 for 
example, in 1 0-mers again the P residue and acidic amino acids were not tolerated. In addition at 
position 1 in 10-mers aromatic residues were frequently observed in A2. 1 binders. In position 3, 
acidic amino acids were frequently associated with poor binding capacity in both 9-mers and 10- 
mers. Interestingly, however, while in position 3 aromatic residues were preferred in 9-mers, 
aliphatic residues (L, V, I, M) were preferred in 1 0-mers. 
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TABLE 13 



Summary of A2.1 Motlf-Llbrary , 10-mers 



AA position 
1 



8 



10 



(+) 



(YFW), A 
Anchor 




(YFW), (LVIM) 



Anchor 



LI 
(DE), P 



(DE), (RKH) 
(RKH) 



(+) = Ratio > 4-fold (-) = Ratio s 0.25 



TABLE 14 



A2.1 Motif for 10-mer peptides 



TABLE 15 



Comparison 



of A2.1 binding of 9-mers 

and of 10-mers 



9-mers 



1 0-mers 




9-mers 



1 0-mers 



100118] At the C-terminus of the peptides, basic amino acids are not favored in position 7, 
and both acidic and basic amino acids are not favored in position 8 for 10-mers. This is in 
striking agreement with the observation that the same pattern was found in 9-mers at positions 6 
and 7. Interestingly, again the favored residues differ between two peptide's sizes. Aromatic (Y, 
F, W) or aliphatic (L, V, I, M) residues were preferred in 10-mers at position 8, while the A 
residue was preferred by 9-mers in the corresponding position 7. 

[00119] By contrast, in the center of the peptide no similarities of frequency preferences 
were observed at positions 4, 5, and 6 in 10-mers and positions 4 and 5 in the 9-mers. 

[001 20] Most interestingly, among the residues most favored in the center of the tested 
peptides were G in position 4 and 6, P in position 5 was not observed in binders. All of these 
residues are known to dramatically influence the overall secondary structure of peptides, and in 
particular would be predicted to strongly influence the propensity of a 10-mer to adopt a "kinked" 
or "bulged" conformation. 

[00121 ] Charged residues are predominantly deleterious for binding and are frequently 
observed in non-binders of 9-mers and 1 0-mers. 

[001 22 ] However, favored residues are different for 9-mers and 1 0-mers. Glycine is 
favored while Proline is disfavored in the center of 1 0-mer peptides but this is not the case for 9- 



mers. 



1001231 These data establish the existence of an "insertion area" spanning two positions (4, 
5) in 9-mers and 3 positions (4, 5, 6) in 10-mers. This insertion area is a more permissive region 
where few residue similarities are observed between the 9-mer and 10-mer antigenic peptides. 
Furthermore, in addition to the highly conserved anchor positions 2 and 9, there are "anchor 
areas" for unfavored residues in positions 1 and 3 at the N-terminus for both 9-mer and 10-mer 
and positions 7- 1 0 or 6-9 at the C-terminus for 1 0-mers and 9-mers, respectively. 

Example 6 



Algorithm to Predict Binding of 9-mer Peptides to HLA-A2. 1 

[001 24] Within the population of potential A2. 1 binding peptides identified by the 2,9 
motif, as shown in the previous example, only a few peptides are actually good or intermediate 
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binders and thus potentially immunogenic. It is apparent from the data previously described that 
the residues present in positions other than 2 and 9 can influence, often profoundly, the binding 
affinity of a peptide. For example, acidic residues at position 1 for A2.1 peptides do not appear 
to be tolerated. Therefore, a more exact predictor of binding could be generated by taking into 
account the effects of different residues at each position of a peptide sequence, in addition to 
positions 2 and 9. 

[00125] More specifically, we have utilized the data bank obtained during the screening of 
our collection of A2.1 motif containing 9-mer peptides to develop an algorithm which assigns a 
score for each amino acid, at each position along a peptide. The score for each residue is taken 
as the ratio of the frequency of that residue in good and intermediate binders to the frequency of 
occurrence of that residue in non-binders. 

100126] In the present "Grouped Ratio" algorithm residues have been grouped by 
similarity. This avoids the problem encountered with some rare residues, such as tryptophan, 
where there are too few occurrences to obtain a statistically significant ratio. Table 16 is a listing 
of scores obtained by grouping for each of the twenty amino acids by position for 9-mer peptides' 
containing perfect 2/9 motifs. A peptide is scored in the "Grouped Ratio" algorithm as a product 
of the scores of each of its residues. In the case of positions other than 2 and 9, the scores have 
been derived using a set of peptides which contain only preferred residues in positions 2 and 9. 
To enable us to extend our "Grouped Ratio" algorithm to peptides which may have residues other 
than the preferred ones at 2 and 9, scores for 2 and 9 have been derived from a set of peptides 
which are single amino acid substitutions at positions 2 and 9. Figure 2 shows a scattergram of 
the log of relative binding plotted against "Grouped Ratio" algorithm score for our collection of 
9-mer peptides from the previous example. 
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[00127] The present "Grouped Ratio" algorithm can be used to predict a population of 
peptides with the highest occurrence of good binders. If one were to rely, for example, solely on 
a 2(L,M) and 9(L,I, and V) motif for predicting A2. 1 binding 9-mer peptides, it would have been 
predicted that all 161 peptides in our database would be good binders. In fact, as has already 
been described, only 12% of these peptides would be described as good binders and only 22% as 
intermediate binders; 66% of the peptides predicted by such a 2,9 motif are either weak or non- 
binding peptides. In contrast, using the "Grouped Ratio" algorithm described above, and 
selecting a score of 1.0 as threshold, 20 peptides were selected. Of this set, 50% are good 
binders, and 45% are intermediate, while only 5% are weak and 0% are non-binders (Table 17). 

[00128] The present example of an algorithm has used the ratio of binders/non-binders to 
measure the impact of a particular residue at each position of a peptide. It is immediately 
apparent to one of ordinary skill that there are alternative ways of creating a similar algorithm. 

[001291 An algorithm using the average binding affinity of all the peptides with a certain 
amino acid (or amino acid type) at a certain position has the advantage of including all of the 
peptides in the analysis, and not just good/intermediate binders and non-binders. Moreover, it 
gives a more quantitative measure of affinity than the simpler "Grouped Ratio" algorithm. We 
have created such an algorithm by calculating for each amino acid, by position, the average log of 
binding when that particular residue occurs in our set of 161 2,9 motif containing peptides. 
These values are shown in Table 1 8. The algorithm score for a peptide is then taken as the sum 
. of the scores by position for each residues. Figure 3 shows a scattergram of the log of relative 
binding against the average "Log of Binding" algorithm score. Table 17 shows the ability of the 
two algorithms to predict peptide binding at various levels, as a function of the cut-off score 
used. The ability of a 2,9 motif to predict binding in the same peptide set is also shown for 
reference purposes. It is clear from this comparison that both algorithms of this invention have a 
greater ability to predict populations with higher frequencies of good binders than a 2,9 motif 
alone. Differences between the "Grouped Ratio" algorithm and the "Log of Binding" algorithm 
are small in the set of peptides analyzed here, but do suggest that the "Log of Binding" algorithm 
is a better, if only slightly, predictor than the "Grouped Ratio" algorithm. 



Example 7 



Use of an Algorithm to Predict Binding of 10-mer Peptides to HLA-A2. 1 

1001301 Using the methods described in the proceeding example, an analogous set of 
algorithms has been developed for predicting the binding of 10-mer peptides. Table 19 shows 
the scores used in a "Grouped Ratio" algorithm, and Table 20 shows the "Log of Binding" 
algorithm scores, for 10-mer peptides. Table 21 shows a comparison of the application of 
different algorithmic methods to select binding peptides. Figures 4 and 5 show, respectively, 
scattergrams of a set of 1 0-mer peptides containing preferred residues in positions 2 and 1 0 as 
scored by the "Grouped Ratio" and "Log of Binding" algorithms. 

Example 8 

Binding of A2.1 Algorithm Predicted Peptides 

[00131] The results of Examples 6 and 7 indicate that an algorithm can be used to select 
peptides that bind to HLA-A2. 1 sufficiently to have a high probability of being immunogenic. 

[00132] To test this result, we tested our algorithm on a large (over 1300) non-redundant, 
independent set of peptides derived from various sources. After scoring this set with 
algorithm, we selected 41 peptides (Table 22) for synthesis, and tested them for A2. 1 binding. 
This set of peptides was comprised of 21 peptides with high algorithm scores, and 20 peptides 
with low algorithm scores. 
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100133] The binding data and categorization profile are shown in Tables 22 and 23 
respectively. The correlation between binding and algorithm score was 0.69. The striking 
difference between peptides with high algorithm scores is immediately apparent from Table 22, 
and those with low algorithm scores. Respectively, 76% of the high scorers and none of the low 
scorers were either good or intermediate binders. This data demonstrates the utility of the 
algorithm of this invention. 
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Example 9 



Ex vivo induction of Cytotoxic T Lymphocytes (CTL) 

[00134] Peripheral blood mononuclear cells (PBMC) are isolated from an HLA-typed 
patient by either venipuncture or apheresis (depending upon the initial amount of CTLp 
required), and purified by gradient centrifugation using Ficoll-Paque (Pharmacia). Typically, one 
can obtain one million PBMC for every ml of peripheral blood, or alternatively, a typical 
apheresis procedure can yield up to a total of 1-10 X 10 10 PBMC. 

[00135] The isolated and purified PBMC are co-cultured with an appropriate number of 
antigen presenting cell (APC), previously incubated ("pulsed") with an appropriate amount of 
synthetic peptide (containing the HLA binding motif and the sequence of the antigen in 
Question). PBMC are usually incubated at 1-2 X 10 6 cells/ml in culture medium such as RPMI- 
1 640 (with autologous serum or plasma) or the serum-free medium ATM-V (Gibco). 

[00136] APC are usually used at concentrations ranging from 1X10 4 to 2X10 5 cells/ml, 
depending on the type of cell used. Possible sources of APC include: 1) autologous dendritic 
cells (DC), which are isolated from PBMC and purified as described (Inaba, et al., J. Exp. Med. 
166:182 (1987)); and 2) mutant and genetically engineered mammalian cells that express 
"empty" HLA molecules (which are syngeneic [genetically identical] to the patient's allelic HLA 
form), such as the, mouse RMA-S cell line or the human T2 cell line. APC containing empty 
HLA molecules are known to be potent inducers of CTL responses, possibly because the peptide 
can associate more readily with empty MHC molecules than with MHC molecules which are 
occupied by other peptides (DeBruijn, et al., Eur. J. Immunol. 21 :2963-2970 (1991)). 

[00137] In those cases when the APC used are not autologous, the cells will have to be 
gamma irradiated with an appropriate dose (using, e.g., radioactive cesium or cobalt) to prevent 
their proliferation both ex vivo, and when the cells are re-introduced into the patients. 

[001 38] The mixture cultures, containing PBMC, APC and peptide are kept in an 
appropriate culture vessel such as plastic T-flasks, gas-permeable plastic bags, or roller bottles, at 
37' centigrade in a humid air/C0 2 inC ubator. After the activation phase of the culture, which 
usually occurs during the first 3-5 days, the resulting effector CTL can be further expanded, by 
the addition of recombinant DNA-deri ved growth factors such as interleukin-2 (IL-2), 
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interleukin-4 (IL-4), or interleukin-7 (DL-7) to the cultures. An expansion culture can be kept for 
an additional 5 to 12 days, depending on the numbers of effector CTL required for a particular 
patient. In addition, expansion cultures may be performed using hollow fiber artificial capillary 
systems (Cellco), where larger numbers of cells (up to 1X10 11 ) can be maintained. 

[00139] Before the cells are infused into the patient, they are tested for activity, viability, 
toxicity and sterility. The cytotoxic activity of the resulting CTL can be determined by a standard 
5I Cr-release assay (Biddison, W.E. 1991, Current Protocols in Immunology, p7,17.1-7.17.5, Ed. 
J. Coligan et al., J. Wiley and Sons, New York), using target cells that express the appropriate 
HLA molecule, in the presence and absence of the immunogenic peptide. Viability is determined 
by the exclusion of trypan blue dye by live cells. Cells are tested for the presence of endotoxin 
by conventional techniques. Finally, the presence of bacterial or fungal contamination is 
determined by appropriate microbiological methods (chocolate agar, etc.). Once the cells pass all 
quality control and safety tests, they are washed and placed in the appropriate infusion solution 
(Ringer/glucose lactate) and infused intravenously into the patient. 

Example 10 

Assays for CTL Activity 

[00140] 1 . Peptide synthesis: Peptide syntheses were carried out by sequential 
coupling of N-a-Fmoc-protected amino acids on an Applied Biosystems (Foster City, CA) 430A 

peptide synthesizer using standard Fmoc coupling cycles (software version 1.40). All amino 
acids, reagents, and resins were obtained from Applied Biosystems or Bachem. Solvents were 
obtained from Burdick & Jackson. Solid-phase synthesis was started from an appropriately 
substituted Fmoc-amino acid-Sasrin resin. The loading of the starting resin was 0.5-0.7 mmol/g 
polystyrene, and 0.1 or 0.25 meq were used in each synthesis. A typical reaction cycle proceeded 
as follows: 1) The.N-terminal Fmoc group was removed with 25% piperidine in 
dimethylformamide (DMF) for 5 minutes, followed by another treatment with 25% piperdine in 
DMF for 15 minutes. The resin was washed 5 times with DMF. An N-methylpyrolidone (NMP) 
solution of a 4 to 10 fold excess of a pre-formed l-hydroxybenzotriazole ester of the appropriate 
Fmoc-amino acid was added to the resin and the mixture was allowed to react for 30-90 min. 
■ The resin was washed with DMF in preparation for the next elongation cycle. The fully 
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protected, resin bound peptide was subjected to a piperidine cycle to remove the terminal Fmoc 
group. The product was washed with dichloromethane and dried. The resin was then treated 
with trifluoroacetic acid in the presence of appropriate scavengers [e.g. 5% (v/v) water] for 60 
minutes at 20°C. After evaporation of excess trifluoroacetic acid, the crude peptide was washed 
with dimethyl ether, dissolved in water and lyophilized. The peptides wee purified to >95% 
homogeneity by reverse-phase HPLC using H2O/CH3CN gradients containing 0.2% TFA 
modifier on a Vydac, 300A pore-size, C-18 preparative column. The purity of the synthetic 
peptides was assayed on an analytical reverse-phase column, and their composition ascertained 
by amino acid analysis and/or sequencing. Peptides were routinely dissolved in DMSO at the 
concentration of 20 mg/ml. 

[00141] 2. Media: RPMI-1640 containing 10% fetal calf serum (FCS) 2 mM 
Glutamine, 50 jig/ml Gentamicin and 5X10" 5 M 2-mercaptoethanol served as culture medium and 
will be referred to as R10 medium. 

[00142] RPMI-1640 containing 25 mM Hepes buffer and supplemented with 2% FCS was 
used as cell washing medium. 

[00143] 3. Rat Concanavalin A supernatant: The spleen cells obtained from Lewis 
rats (Sprague-Dawley) were resuspended at a concentration of 510 6 cells/ml in R10 medium 
supplemented with 5 /ig/ml of ConA in 75 cm2 tissue culture flasks. After 48 hr at 37°C, the 
supernatants were collected, supplemented with 1% a-methyl-D-mannoside and filter sterilized 
(.45 fim filter). Aliquots were stored frozen at -20°C. 

[00144] 4. LPS-activated lymphoblasts: Murine splenocytes were resuspended at a 
concentration of l-1.5xl0 6 /nil in R10 medium supplemented with 25 jig/ml LPS and 7 /tg/ml 
dextran sulfate in 75 cm 2 tissue culture flasks. After 72 hours at 37°C, the lymphoblasts were 
collected for use by centrifugation. 

[00145] 5. Peptide coating of lymphoblasts: Coating of the LPS activated 
lymphoblasts was achieved by incubating 30xl0 6 lymphoblasts with 100 ng of peptide in 1 ml of 
R10 medium for 1 hr at 37°C. Cells were then washed once and resuspended in R10 medium at 
the desired concentration for use in in vitro CTL activation. 

[00146] 6. Peptide coating of Jurkat A27K b cells: Peptide coating was achieved by 
incubating 10xlO 6 irradiated 20,000 rads) Jurkat A2.1/K b cells with 20 fig of peptide in 1 ml of 
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RIO medium for 1 hour at 37*C Cells were washed three times and resuspended at the required 
concentrati on in R 1 0 medium. 



[00147] 7. In Vitro CTL activation: One to four weeks after priming spleen cells (5 x 
10 6 cells/well or 30xl0 6 cells/T25 flask) were concultured at 37°C with syngeneic, irradiated 
(3,000 rads), peptide coated lymphoblasts (2xl0 6 cells/well or 1OX10 6 cells/T25 flask) in RIO 
medium to give a final volume of 2 ml in 24-well plates or 10 ml in T25 flasks. 

[00148] 8. Restimulation of effector cells: Seven'to ten days after the initial in vitro 
activation, described in paragraph 8 above, a portion of the effector cells were restimulated with 
irradiated (20,000 rads), peptide-coated Jurkat A2/K b cells (0.2xlO 6 cells/well) in the presence of 
3xl0 6 "feeder cells'/well (C57B1/6 irradiated spleen cells) in RIO medium supplemented with 
5% rat ConA supernatant to help provide all of the cytokines needed for optimal effector cell 
growth. 

[00149] 9. Assay for cytotoxic activity: -Target cells (3x10*) were incubated at 37*C 
in the presence of 200 jd of sodium 51 Cr chromate. After 60 minutes, cells were washed three 
times and resuspended in R10 medium. Peptide 875.15 was added at the required concentration. 
For the assay, 10 4 51 Cr-labeled target cells wee added to different concentrations of effector cells 
(final volume of 200 jil) in U-bottom 96-231 1 plates. After a 6-hour incubation period at 37°C, 
0.1 ml aliquots of supernatant were removed from each well and radioactivity was determined in 
a Micromedic automatic gamma counter. The percent specific lysis was determined by the 
formula: percent specific release = 1 00x(experimental release - spontaneous release)/(maximum 
release - spontaneous release). Where peptide titrations wee performed, the antigenicity of a 
given peptide (for comparison purposes) was expressed as the peptide concentration required to 
induce 40% specific 51 Cr release at a given E:T. 

[00150] Transgenic mice were injected subcutaneously in the base of the tail with an 
incomplete Freund's adjuvant emulsion containing 50 nM of the putative CTL epitopes 
containing the A2.1 motifs, and 50 nM of the hepatitis B core T helper epitope, Cytel No. 875.23. 
Eight to 20 days later, animals were sacrificed and spleen cells were restimulated in vitro with 
syngeneic LPS lymphoblasts coated with the purative CTL epitope. A source of JL-2 (rat con A 
supernatant) was added at day 6 of the assay to a final concentration of 5% and CTL activity was 
measured on day 7. The capacity of these effector T cells to lyse peptide-coated target cells that 
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express the A2 KB molecule (Jurkat A2 KB) was measured as lytic units. The results are 
presented in Table 23. 

f00151] The results of this experiment indicate that those peptides having a binding of at 
least 0.01 are capable of inducing CTL. All of the peptides in Appendices 1 and 2 having a 
binding of at least about 0.01 would be immunogenic. 
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Example 1 1 

[00152J Class I antigen isolation was carried out as described in the parent applications. 
Naturally processed peptides were then Isolated and sequenced as described there. An allele- 
specific motif and algorithms were determined and quantitative binding assays were carried. 

[00153] Using the motifs identified above for HLA-A2.1 allele amino acid sequences from 
a tumor-related proteins, Melanoma Antigen- 1,-2, and -3 (MAGE-1, -2, and -3), were analyzed 
for the presence of these motifs. Sequences for the target antigen are obtained from the GenBank 
data base (Release No. 71.0; 3/92). The identification of motifs is done using the 
"FINDPATTERNS" program (Devereux et al., Nucleic Acids Research 12:387-395 (1984)). 

[001 54] Other viral and tumor-related proteins can also be analyzed for the presence of 
these motifs. The amino acid sequence or the nucleotide sequence encoding products is obtained 
from the GenBank database in the cases of Human Papilloma Virus (HPV), Prostate Specific 
antigen (PSA), p53 oncogene, Epstein Barr Nuclear Antigen- 1 (EBNA-1), and c-erb2 oncogene 
(also called HER-2/neu). 

[00155] In the cases of Hepatitis B Virus (HBV), Hepatitis C Virus (HCV), and Human 
Immunodeficiency Virus (HTV) several strains/isolates exist and many sequences have been 
placed in GenBank. 

[00156] For HBV, binding motifs are identified for the adr, adw and ayw types. In order 
to avoid replication of identical sequences, all of the adr motifs and only those motifs from adw 
and ayw that are not present in adr are added to the list of peptides. 

[00157] In the case of HCV, a consensus sequence from residue 1 to residue 782 is derived 
from 9 viral isolates. Motifs are identified on those regions that have no or very little (one 
residue) variation between the 9 isolates. The sequences of residues 783 to 3010 from 5 viral 
isolates were also analyzed. Motifs common to all the isolates are identified and added to the 
peptide list. 

[00158] Finally, a consensus sequence for HTV type 1 for North American viral isolates 
(10-12 viruses) was obtained from the Los Alamos National Laboratory database (May 1991 
release) and analyzed in order to identify motifs that are constant throughout most viral isolates. 



Motifs that bear a small degree of variation (one residue, in 2 forms) were also added to the 
peptide list. 

1001521 Table 1 4 provides the results of searches of the following antigens cERB2, 
EBNA1, HBV, HCV, HTV, HPV, MAGE, p53, and PSA. Only peptides with binding affinity of 
at least 1% as compared to the standard peptide in assays described in Example 5 are presented. 
Binding as compared to the standard peptide is shown in the far right column. The column 
labeled "Pos." indicates the position in the antigenic protein at which the sequence occurs. 

[00160] Tables 1 5 and 1 6 provide the results of these searches. Binding affinities are 
expressed as percentage of binding compared to standard peptide in the assays as described in the 
parent applications are presented. 

1001611 Although the present invention has been described in some detail by way of 
illustration and example for purposes of clarity and understanding, it will be apparent that certain 
changes and modifications may be practiced within the scope of the appended claims. 
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Table 14 



Peptide 


Sequence 


AA 


Virus 


Strain 


Molecule 


Po*. 


A2.1 


1.0641 


| 1LSPFLPLL 


9 


HBV 


adr 


ENV | 371 


2-9 


1.0240 


TUQDiVLHL 


9 


• HPV 


18 


E7 1 7 


0J6 


l.Offlfl 


VVL5LLVPFV 


9 


HBV 


adr 


ENV 


335 


072 


1.0851 


FLLSLCtHL 


9 


Hbv 


mdr 


POL 


1147 


052 




QLFEDNYAL 


9 


C-EKB2 






106 


0.46 


1.0S14 


LMVTVYYGV 


9 


HIV 




ENV 


2182 


0.44 


1.0378 


MMWFWGTSL 


9 


HBV 


ndw 


ENV 


^0 

AM 


0.41 


1.0839 


MMWYWOPSL 


9 


HBV 


adr 


ENV 


360 


a4t 


1.0384 


FLTKQYLNL 


9 


HBV 


adw 


POL 


1279 


029 


1.0321 


LLHNCAY5L 


9 


C-ERB2 




435 


0.21 


1.0834 


LLLCLrFLL 


9 


HBV 




ENV 


220 


0.19 


1.0167 


CLY5STVPV - 


9 


HBV 


adr 


POL 


635 


ais 


1.0849 


HLY5HPEL 


9 


HBV 


adr 


POL 


1076 


ai3 


1.0275 


KMTEAAPPV 


9 1 p53 






65 


an 


1.0654 


LLMGTLGrV 


9 | HPV 


16 


E7 


fi2 


an 


1.C880 


| ILSTTMPLL 


9 


HBV 


adw 


ENV 


371 


0.1] 


1.0127 


YLVAYQATV 


9 


HCV 




LOR? 


1585 


an 


1.0151 


VLLDYQGML 


V 
9 


HBV 


odr 


ENV 


259 


0.11 


1.0018 


VLAEAMSQV 


inv 




GAG 


367 


o.n 


1.0330 




9 


C-EK32 






639 


0.091 


1.0209 


SLYAVSPSV 


9 


HBV 


adr 


tol 


1388 


0.078 


1.0816 


DLMCY1TLV 


9 


HCV 




COKE 


132 


0.G55 


1.083d 


LLCLIFLLV 


9 


HBV 


adr 


ENV 


251 
1250 


0.W9 


1.0&52 


FLCQQYLHL 


9 


. HBV 


adr 


POL 


0.048 




NLYVSLMLL 


9 


HBV 


adw 


POL 


1088 
263 




l.UfiJ/ 


GMIPVCPLL 


1 9 


HBV 


adr 


ENV 


0.O46 


1 HAT<3 


ILPCSFTTL 


9 


HCV 




NS1/ENV2 


676 


0.045 


i .\JL\jy 


AL5TCLJHL 


9 HCV 




NSI/ENV2 686 


a042 




ILLLCLIFL 


9 


HBV 


adr 


ENV 


249 


a035 


1.0331 




9 


0-BRB2 






48 


a034 
a034 


1 0337 


UL lb I VQJLV 


9 


o-ERH2 






789 


1 0342 




9 
9 


HBV 


adr ENV 


377 


0.031 
0.031 




AT T t 


C-ERB2 






5 




V L> I^JSIN 1 L 


9 








153 
177 


0.029 


1 Gfl3S 




9 


HBV 


adr ENV 


0.024 


1.0844 




9 


HBV 


adr I 


CORE 


490 


aQ24 


1.0135 


1 Lnu I UAu V 


9 


HCV 




LORF 


1851 


0.QU 


1.087U 


or Mpvrn t 


9 


<sERB2 






799 


0.Q23 


1.0CF5 


t t wvr cr* a vf 

LL#M Wo EVaA Y 


9 


HIV 




POL 


1496 




1,0873 


r LL»L« 1 rvti 


9 


HBV 


adw 


ENV 


204 


0.021 


1.0323 


AUHHNTHL 


9 oERB2 






466 


0.021 


1.0859 


VLVIiPQWVL 


9 | PSA 






49 


_ 0.020 

"aot9 


1.0267 


KLQCVDLHV 


9 PSA 






166 


1.0S20 




9 


HCV 




NSI/ENV2 


£76 


aoi7 


1.0111 


HLHQNIVDV 


9 


HCV 




NS1/ENV2 


693 


0.016 


1.0103 


SMVGNWAKV 


9 


HCV 




ENV1 


364 


aoi6 


1.Q2S3 


LLCKNSFEV 


9 


* P53 




264 


aou 


1.0207 


CLYRPLLSL 


9 


HBV 


■dr 


POL 


1370 


0014 


1.0389 


CLYXPLLRL 


9 


HBV 




POL 


1399 


O014 


1.018S 


NLSWLSLDV 


9 


HBV 


adr 


POL 


996 


0XH3 


1.0113 


FLLLADARV 


9 


HCV 




NS1/ENV2 


725 


0.013 


1.0119 


YLVTRHADV 


9 


HCV 




LORF 


1131 


om\ 



Table 14 




1.0092 



Table 14 



Peptide 



1.0108 



1.G2W 



Sequ 



TLPA1STCL 



ALAIPQCRL 



1.0101 



DLCCSVH.V 



1.0231 | RLCVQSTHV 



1.0162 



1.0126 



1.0163 
1.0130 



i.00t2 



1.0C69 



1.0C9I 



1.0093 



1.0154 



LXJDDEACPL 



CLKRFUFL 



CLPVCQDHL 



PUEHHLPRL 



TLLYRLCAV 



EUHNREiL 



ELQAIHLAL 



NLGKVIDTL 



PLGGAAKAL 



1.0173 



1.017V 



l.CGSft 



1.029* 



DLLDTASAL 



QUCQSRLCU 



CLQPQQGSL 



PLDGEYPTL 



VLKDAECDL 



1.0310 



1.00C7 



1.0C37 



1.0070 



1.UI57 



QLCYQDTIL 



DLNTMLNTV 



ELHPOKVYTV 



ELKXIICQV 



1.0160 



ELWSYVNV 



CLTPCRETV 



1.0164 



1.0667 



1.0159 



OLNLGNLNv 



LLVWLCW 



1.0322 



NMGLKRQL 



SLRBLGSGL 



i.osso 



DLLEXCERL 



1.0332 | DLVDAEEYL 



1.0366 



fLEEELPIIL 



1.CB72 



1.0390 



l.CSll 



1.C812 
1.C82 



1.CSH7 



1.0316 



DLQHCRLVL 



TLFCPtCAL 



LLTQICCTT. 



PLVKLWYQL 
FLFTLLLCL 



NLrVSLLU 



1.0M2 
l.OStf 



1.0356 



FLQPBQIQV 



DLAARNVLV 
VtVKSPiVHV 
TLSPCKNGV 



1.0376 



1.0363 



DLSWtSLDV 
KMEIsnASGL 



1.0195 | TLPQEH1VL 



1.0196 



1.0201 



1.0210 



1.0220 



KLKQCFRKT, 



PLPIHTAEL 
QLDPARDVl 



vlt^crhkl 



1.0229 



1.0245 



1.0266 



DLQ^kl 1DL 
ALEAQQEAL 



DUTQEPAL 
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Table 14 



Peptide 


Sequence 


AA 


Virus 


Strain 


Molecule 


Pew. 


A2.1 




1.0690 


LLFN1LGGWV 


io i 


HCV 




LORF 


1807 


35 




1.0930 


LLVl'rVQWfV 


10 


H3V 




ENV 


333 


1.6 




1.CS84 


LLALLSCLTV 


10 


HCV 




CORE | 


173 


0.61 




1.0695 


TLLLCLrFLL 


10 


HBV 


*dr 


ENV | 


249 


0JO 




1.0518 


GLSPTVWLSV 


10 


ITHV 


adr 


ENV 


343 


0J8 




1.0902 


SLYNILSPFt. 


to 


HBV 


odr 


ENV 


367 


0^3 




1.0692 


LLVLQAGFrL 


To""" 


H3Y 


•dr 


ENV | 


175 


0-21 




1.C6R6 


FUQTH1FAEV 


10 


EHKA1 




1 W 


n 1*7 
LLl/ 




1.CK23 


QLFLNTLSFV 


10 


HIV 


18 


E7 | 


£3 


U. & 1 




1.0904 


LLPIFFCLWV 


• * m 

10 


HBV 




ENV 


378 


n in 




1.0S7 


LLLCUH-LV | 


10 


HBV 


*dr 


ENV 


250 


n i iM 




1.0516 


LLDYQCMLPV | 


10 


HBV 


odr 


ENV 


260 


u.uso 




1.0901 


WMMVVYWGPSL | 


10 t 


HBV 


adr 


ENV 


359 


n nai 




j 1.033 


GLYSSTVFVL [ 


10 | 


HBV 


a«Jr 


POL I 


635 


n ftxii 

U.VJOV 




1.0469 


YLTJPRRCPRL 1 


10 | 


• HCV 




CORE 


35 


U.U/J 




1.CS83 


"gllgcutsl 1 


10 


HCV 




LORF 


1036 


n rvi.1 






1.CS07 


ILCWGELMNL 


10 


HBV 


adr 


COKR 


4/. 9 


(]j052 






1.0927 


LLC1CLTSTV 


10 


C-ERB2 






7Ho 








1.0452 


llwkgf-cavv j 


10 


HtV 


i 


pot. 


1496 


0.U36 






1.CWR5 


LLALLSCLT1 | 


10 


HCV 


| COKE 


17a 


M tYlA 






1.0620 


KLTNTGLYNL 


10 


iirv 


15 


E6 


52 








RT.TVFPDLGV 


10 


HCV 




LORF 




01332 






1.0659 


FLTL'XXLQCV 


10 








151 


0.031 
* '- ■ ■ 






1.0932 


WMMWFWGl*SL 


10 


HBV 


idw 


ENV 


3a9 








1.0772 


SLNh'LGGTVV | 


10 


HBV 


adw 


ENV 


2ul 


ft fYJT 






1.06U9 


SLQDiETTCV | 10 


H?V 


IS 


E6 


in 








1.CI525 


ILS-LLPhilV 


1 io 


H3V 


adr 


CORE 


529 


0035 






1.0508 


RLHCLSAF5L 


1 io 


HCV 




LORP 


2585 


0.020 






1.0493 


ILGGWVAAQL 


10 


HCV 




LORF 


1511 


0.018 




1.0736 


VMACVC5FYV 


10 


C-EKB2 






773 


0.01 d 
0.017 






i.ftsso 


QLMVTVYYCV 


1 10 


K1V 




1 ENV 


21fil 






1.0573 


' ilkctsfvyv 


10 


HBV 


• dr 


tol 


1345 


0.016 




1.07U3 


SLTEILKGCV 


10 




1 


144 


0.015 






1.0912 _ 


LLCCAANVYIL 


io" 


Trrjvr 

rtoV 


adr 


| VOL 


1337 


0.014 






" 1.0793 


A LP PAS PSA V 


10 


HBV 


gdw 


I 


1433 


0.013 




1.0903 


QLLWFHTSO. 


10 


v rm r 
I LBV 


adr 


CORE 


439 


aoi3 


1.0677 


NLLGKNSFEV 


10 


pS 






263 


OJ313 


1.CS&9 


VLAALAAYCL | 10 


MTV 




LORF 


1666 


0.011 




1.052* 


LLLDDEAGPL 


10 


ruv 


•dr 


| POL 


536 


aoi i 
0J»38" 






1.0500 


1MAKNBVFCV 


10 


urv 


LORF 


2553 






1.0492 


VLVCCVLAAL 


10 


nv.* 


LORF 


1661 


0.0034 
0.CO75 j 




1.0898 


LLCUFLLVL 


10 


1 1DV 


adr 


ENV 


251 


1.0458 
1.0459 


KLMVTVYYGV 


10 


HTV 




ENV 


2181 


0i»69 


NLMVTVrrGV 


10_ 


HTV 




ENV 


2181 


0J3067 




1.0530 


CLSPTVWLSA 


"io 


HBV 


adw 


ENV 


348 


0.0067 




1.0739 


SLPTHDPSPL 


10 


C-KRB2 






11Q0 


0.0059 






1.0419 


VLPEKDSVVTV 


10 


HIV 




POL 


940 


a0Q56 






i 1.0666 


FLHSCTAKSV 


■ "To 


P* 






113 


OJUtDO 




1.0473 


GL1HLHQN1V 


10 


HCV 




N51/ENV2 


690 


a0047 




1.0792 


I SLYAAVTNfL 


10 


HBV 


adw 


POt. 


1163 


0J0O46 




" 10780 | IMI'ARFYPNV 


10 


HBV 


■dw 


POL 


713 


0.0013 






1.0507 1 YLTRDFTTPL 


1 10 


HCV 




LORF 


2303 





Table 14 



Peptide 
1.0914 



1.0619 



1.0561 



1.0788 
1.U753 



1.0568 



1.0642 



1.0582 



1.0713 
l.U74i 



1.0512 
1.0676 



1.0719 



1.0627 



1.072 5 
1.0918 



1.049 9 
1.0559 



1.0632 



1.0451 



1.0470 



1.0397 



1.0446 



1.QKM 



1.0521 



1.Q5S3 



1.0652 



1.C716 



1.0723 



1.0497 



Sequence 
CLYNLURCL 



AA 

10 



YLHYGRCRTV | io 



SLFTSTTNPL 
NLLSSDLSWL 



RMARDPQRFV 



RMRGTFWTL 



5LQLVFGEDV 



10 



10 



10 



10 



10 



KLLHKRTLCL | 10 



CLCMEHLRBV 10 



C^SYLEDVRL | 10 



1.0549 | NLLSSNLSWL 
UV465 QLTVWGIKQL 
1.0524 VLEYLVSFGV 



l.O^a VLN1SVAATI 
1.0548 I " SLTMLLS5NL 



10 



to 



10 



10 



10 



ALLDPRVRGL I 10 



TLEDSSGNLL I 10 



TLQCLGISWL I 10 



DLRAFQQLFL | 10 



VLQGLFREYV | 10 



DL PFWFPPMV | 1 0 
DLSDCSWSP/ | 10 



CtAFSYMDDV | 10 



PLVLCTLEEV 



1.0520 NLATWVCSNL 
l.tettT NLLTQIGCTL 
1.0458 CLT1-EIDAHPL 



10 



10 



10 



to 



1.0733 VLGSCAFGTV 
1.0434 QLIKKBKVYL 



10 



10 



KLLWKGEGAV | 10 



SMVCNWAKVL | 10 



1.0570 KLICTDNSW | 10 
1.0924 | rLLWVLGW | " jp" 

10* 



LLDTCADDTV 



HLKTAVQMAV 



DLLMGTLGtV 



1.0443 LLKLACRWFV 



1.0*61 DLMVTVYYCV 



10 



1U 



10 



10 



1.0619 | TLEKLTNTCL | 10 
h07S7 !' SLTNLLSSOL | 10 



NLEDPASBEL 



CLSAMS7TDL 



VLVASRGRAV 



DLSVPQNLQV 



QLfTWPHQAL 



10 



10 



10 



in 



10 



10 



1.0727 PLTStlSAW 

1.0479 1 YUCt'SSGGi'L' | 10 



QLTCEPEPDV 



1.0523 CLTFCRBTVL 

1.0603 TLEDLLMCTL 

1. 0631 ' SLHOCPEEAL' 

l"0660 EMFKELNEAL 



10 



10 



_I0 
10 

lo 



Vina 
HIV 



MACE 



HBV 



HBV 



HBV 



WAGS 



HBV 



C-BRS2 



0-ERB2 



H3V 



HiV 



HBV 



HCV 



HBV 



HHV 



^53 



C-EKB2 



HPV 



C-ERB2 



EBNA1 



HCV 



HBV 



MACS 



HBV 



HIV 



HCV 



C-ERB2 



HIV 



HIV 



HCV 



HBV 



C-ERB2 



HTV 



hnv 



HPV 



HIV 



HIV 



HPV 



HBV 



HBV 



HBV 



PSA 



C-EKB2 



C-ERB2 



C-ERB2 



HCV 



HCV 



HBV 



HPV 

mace" 
_J2» 



Strain 
IT 



*dr 



adr 



18 



adr 



1 



*dr 



adr 



16 



18 



adw 



idr 



adr 



16 



Molecule 
E6 



97 



Ail 

16 



• II tc 



248 



POT. 



POL 



POL 



1139 oxxrw 



1020 0.0032 
978 OjOCEO 



12S8 0.0Q20 



150 0. 



1509 *0.0O19 



[ 344 0,0017 
| ©2 I 0.UO17 



cdr | TOL | 991 0.0016 



ENV 



CORE 



LQRF 



POL 



2760 aoois 



505 0.0015 



1253 0.0015 



988 0.0O14 



ENV 



E7 



LORF 



POL 



119 OjOOII 



256 aooii 



CORK 



FOL 



LOR? 



POL 



POL 



ENV1 



POL 



POL 



roL 



E7 



l*OL 



ENV 



E6 



POL 



1017 



CORE | 465 



LORr 



LORF 



1517 



36 



421 



4*4 



650 



1160 



2159 



CORE | 497 



E7 



I 78 



339 



444 0.0011 



82 aooio 



546 0.0009 



6 05 0,0009 
2399 0,0008 
1118 0.0006 



37 0.0008 



457 0.0008 



634 0.0007 
1564 0.0007 
725 I OJ30G7 



1219 0.0006 



1495 O.O0O6 



364 0.0006 



1317 0.0006 

6 61 0.0006" 

619 | 0.0005 

1426 a0005 



51 0.0005 



1356 0.0004 



21SI OiXXH 
89 CLO0C4 



OJ0004 



0J 



III I fC 



0.0003 



0.0003 



OXXXB 



0J 
0.0002 



0.U0U2 
0D0Q2 



00002 
O XX Xtt 
0.00O2 
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Peptide 


Sequence 




Virus 


Strain 


Molecule 


Poe. 


K *\ H 

A2.1 


1.06S9 


VLKDAUCDLV ! 10 


EUNA1 






574 


0JD0Q2 


1.0757 


DLVDAEEYLV | 10 


C-ER32 






1016 


0.0002 


1.0796 


RMRCTFV5FL 


10 


HSV 




POL 


1317 


0.0002 




QtAKTCPVQL 


10 








136 


0XXXJI 


1.0717 
"l.072l 


NLQVTRGRIL 


10 


C-ER32 


1 

1- 


427 


O.OGul 


WLGLRSLREL 


10 

* • ^ *• M 

10 




t _. 




aoooi 


1.0S22 


NMCIXERQLL 


Hnv 


id: | COKE 




n 
w 


1.0527 


FLSYQHFRKL 


10 


HSV 


%dr j POL 


576 


0 


1.0S29 


ELTKJLADEGL 


10 


HSV 


idr 


POL 


598 


0 


1.0531 


GLNRRVABDL 


10 


HSV 


tdr 


roL 


606 


0 


1.0536 


PLTVNEKRRL 


10 


HHV 




POL 


672, 


0 


1.0539 


IMPARFYPNI. 


10 


HHV 


icfc- | POL 


684 


0 


1.0550 


PLHPAAMPHL 


10 


HUV. 


ad.* 


POL 


1012 


0 


1.0552 


DLHDSCSKNL 


10 


HHV 


•dr 


POL 


1051 


_J3 

0 


1.0555 


LJLYKTFCRXL 


10 


HUV 


adr | POL 


1066 


1.QS57 


PMCVCL5PFL 


10 


H3V 


ad.- | POL 


1090 0 


1.0560 


VLCAKSVQHL 


10 


hhv 


adr | POL 


1123 0 


1.0569 


PLPIHTAELL 


10 


H3V 


ad: 1 POL 


1296 0 


1.0579 


PLPSLAFSAV 


10 


HUV 


ad.- | X 


1454 0 


" 1.0585 DLEAYFKTXX 


10 


HSV 


«&• 1 X 


1525 0 


1.0557 


ELCEEIRLKV 


10 HHV 


adr 


X 


1540 0 


1.0539 


VLCGCRHXLV 


10 

' 10 " 


H3V | ucr 


X 


1551 0 


1.0597 
1.0606 


TLEQQYNKPL 


H?V 


16 
15 


E6 


94 0 


DLCTZLNTSL 
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